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ORFxxxxx Annotation 

ORFQ1580 polysaccharide biosynthesis protein, putative 

ORF01612 conserved hypothetical protein 

ORFQ1613 glycosyl transferase, group 1 family protein 

ORF01 61 7 conserved hypothetical protein 

ORFQ1618 conserved hypothetical protein t 

ORF01621 glycosyl transferase, putative 

ORFQ1622 glycosyl transferase, group 2 family protein 

ORF01623 glycosyl transferase, family 8, degenerate 

ORFQ1624 IS1381, transposase OrfB ; 

QRF01625 1S1381, transposase QrfA , 

QRF01626 glycosyl transferase family 8 

ORF01627 glycosyl transferase, family 8 

ORFQ1628 conserved hypothetical protein < 

QRF01630 cell wall surface anchor family protein 

ORF01635 protease, putative 

ORFQ1643 aminopeptidase PepS (pepS) __ 

ORF01702 peptidase, M20/M25/M40 family 

ORF01731 IS1 381 , transposase QrfA 

ORF01732 1S1381, transposase OrfB _ 

ORFQ1740 tellurite resistance protein TehB (tehB) 

ORF01747 methylated-DNA--protein-cysteine S-methyltransferase (ogt) 

QRFQ1749 acetyltransferase, GNAT family 

ORF01763 AcuB family protein ; _ 

QRFQ1764 branched-chain amino acid ABC transporter, ATP-binding protein (livF) 

QRFQ1765 branched-chain amino acid ABC transporter, ATP-binding protein (livG) 

ORF01766 branched-chain amino acid ABC transporter, permease protein 

ORF01767 branched-chain amino acid ABC transporter, permease protein (HvH) 

ORF01769 branched-chain amino acid ABC transporter, amino acid-binding protein 

ORFQ1775 aminotransferase, class i 

QRF01779 potassium uptake protein, Trk family 

ORF01780 cation uptake protein, Trk family _ 

ORF01824 cobalt transport family protein 

ORF01 826 conserved hypothetical protein 

ORF01832 peptidase, M20/M25/M40 family 

ORF01845 conserved hypothetical protein 

ORF01848 transcriptional regulator, MerR family 

ORF01853 isochorismatase family protein 

QRFQ1 859 membrane protein m 

ORF01875 oxidoreductase, aldo/keto reductase family 

ORFQ1880 phospho-2-dehydro-3-deoxyheptonate aldolase 

ORF01981 rRNA(guanine-N1-)-methyltransferase, putative _ 

ORF02083 prophage LambdaSa2, DNA replication protein DnaC, putative 

QRF02101 Na+/H+ exchanger family protein 

ORF02107 membrane protein, putative 

ORF02139 UDP-glucose 4-epimerase (galE) 

ORF02143 lacX protein 

ORF02162 conserved hypothetical protein 

ORF02186 hemolysin precursor, putative 

ORF02192 transcriptional regulator, MerR family 

ORF02195 MutT/nudix family protein 

ORF02228 1S1381, transposase OrfB _ 

ORF02229 IS1381, transposase QrfA 

QRFQ2233 conserved hypothetical protein , 

ORF02234 conserved hypothetical protein 

ORF02276 5-methyltetrahydropteroyltriglutamate--homocysteine methyltransf erase (metE) 
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ORFxxxxx Annotation m 

ORF02278 branched-chain amino acid transport protein AzlC, putative 

QRF02288 glycosyl transferase, family 8 

ORFQ2289 glycosyl transferase, family 8 _ . 

ORF02341 ribosomal protein L32 (rpmF) a 

QRF02343 conserved hypothetical protein 

ORFQ2358 sensor histidine kinase 

ORF02369 conserved hypothetical protein 

ORF02384 LysM domain protein 

ORFQ2428 hypoxanthine-guanine phosphoribosyltransferase (hpt) 

ORF03011 ribosomal protein L33 

ORF03Q14 ribosomal protein L33 __ 
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ORFxxxxx Annotation 

ORFQQ064 ribosomal protein S14, putative 

ORFQ0095 D-alanyl-D-alanine carboxypeptidase family protein 

ORF00096 N-acetylmuramoyl-L-alanine amidase, family 4 protein 

ORF001 10 conserved hypothetical protein 

ORFQQ1 12 DNA repair protein RadA (radA) 

ORF00124 permease, putative 

ORF00148 giycosyl transferase, group 4 family protein 

QRF00154 penicillin-binding protein 4, putative 

ORFQ0157 oligopeptide ABC transporter, permease protein 

ORF00206 oligopeptide ABC transporter, oligopeptide-binding protein 

ORFQ0207 oligopeptide ABC transporter, permease protein 

ORFQ02Q8 oligopeptide ABC transporter, permease protein 

ORF00209 peptide ABC transporter, ATP-binding protein 

ORF00210 peptide ABC transporter, ATP-binding protein 

ORF00216 IS1548, transposase 

ORF00226 conserved hypothetical protein 

QRFQ0232 conserved hypothetical protein 

ORF00239 site-specific recombinase, phage integrase family 

ORF00250 conserved hypothetical protein 

ORFQ0251 conserved hypothetical protein 

ORF00289 ABC transporter, ATP-binding protein 

QRF00305 NADH oxidase, putative 

ORFQ0317 cell division protein FtsL, putative 

ORF00333 conserved hypothetical protein /_ 

ORF00383 hydrolase, haloacid dehalogenase-like family 

ORF00430 expressed putative lipoprotein 

ORFQ0431 transcriptional repressor CopY 

QRF0Q434 membrane protein, putative 

ORF00438 transcriptional regulator, Fur family 

ORFQ0442 membrane protein, putative 

ORF0Q445 bioY family protein 

ORF00446 AtsA/ElaC family protein 

ORF00468 expressed putative protease 

ORF00469 giycosyl transferase, group 2 family protein 

ORF00471 nrdl protein (nrdl) 

ORF00473 expressed protein of unknown function 

ORF00474 conserved hypothetical protein 

ORF00507 conserved hypothetical protein 

QRF00525 bioY family protein 

QRFQQ528 thiolase 

ORFQ0531 AMP-binding enzyme domain protein 

QRF00548 YGGT family protein 

QRF00565 exodeoxyribonuclease VH, small subunit (xseB) 

ORF0Q568 arginine repressor ArgR, putative 

ORF0Q572 expressed putative lipase/acylhydrolase 

ORF00573 conserved hypothetical protein 

ORF00586 iron-sulfur cluster-binding protein, putative 

ORF00592 oxidoreductase, short chain dehydrogenase/reductase family 

ORFQ0604 dipeptidase 

ORFQ061 1 voltage-gated chloride channel family protein 

ORF00619 prophage LambdaSal, repressor protein, putative 

ORF00622 conserved hypothetical protein 

QRFQ0627 prophage LambdaSal, antirepressor, putative 

ORF00634 conserved hypothetical protein 

ORF0Q648 conserved hypothetical protein 
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QRFxxxxx Annotation __ 

ORFQQ654 conserved hypothetical protein 

ORF00655 conserved hypothetical protein 

QRF00656 conserved hypothetical protein 

QRF00658 conserved hypothetical protein 

ORF00659 conserved hypothetical protein 

ORF0Q660 prophage LambdaSal, structural protein, putative 

QRFQ0662 conserved hypothetical protein 

ORF00663 conserved hypothetical protein 

ORF00664 conserved hypothetical protein 

ORF00665 conserved hypothetical protein 

ORF00666 prophage LambdaSal , structural protein 

ORF0Q668 conserved hypothetical protein 

ORF00669 prophage LambdaSal , pbIA protein, internal deletion 

ORF00677 prophage LambdaSal , lysin, putative 

ORF00679 conserved hypothetical protein 

ORFQQ695 transposase OrfB, 1S3 family, truncation 

ORF0Q697 conserved hypothetical protein "~ 

ORF00707 conserved domain protein 

ORF00713 acid phosphatase precursor, class B 

QRFQ0720 transposase OrfB, 1S3 family FRAMESHIFT 

ORF00721 transposase OrfA, 1S3 family 

QRF0Q751 cylA protein (cylA) 

ORF00755 cyll protein (cyll) __ 

QRF00760 serine protease, subtilase family, putative POINT MUTATION 

ORF00781 transcriptional regulator, LysR family 

ORFQ0783 regulatory protein, putative 

ORF00785 IS1548, transposase 

ORFQ0786 regulatory protein, putative, truncation 

QRF00787 D-lactate dehydrogenase (IdhA) 

ORF00801 glycosyl transferase, group 1 family protein ~ 

ORF00805 conserved hypothetical protein 

ORF00826 phage shock protein C, putative 

QRF00833 conserved hypothetical protein 

ORF00845 hydrolase, haloacid dehalogenase-like family ~ 

ORF00852 conserved hypothetical protein 

ORFQ0853 expressed putative lipoprotein 

ORF00857 IS1548, transposase 

ORF0089Q conserved hypothetical protein 
ORF009Q2 conserved hypothetical protein 
ORF00926 membrane protein, putative 

ORF00927 membrane protein, putative 

ORF00987 conserved hypothetical protein 
QRF01QQ9 expressed protein of unknown function 
ORF01010 lipoyl-binding domain protein 
ORF01011 oxidoreductase, putative 
ORF01Q12 conserved hypothetical protein 

ORF01024 expressed putative lipoprotein ~ 

ORF01Q61 signal peptidase I, putative __ 

ORF01Q64 IS1548, transposase 

ORF01084 glyoxylase family protein 
QRFQ1104 SatD 

ORFQ1126 conserved hypothetical protein 
ORF01191 conaerved hypothetical protein 

ORFQ1192 conserved hypothetical protein 

ORF01193 glycine cleavage system H protein, putative 
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ORFxxxxx Annotation 

ORFQ1 1 94 bacterial luciferase family protein 
QRF01 1 95 oxidoreductase, FMN-binding 
ORF01 197 lipoate-protein ligase A family protein 
ORF012Q2 1S861 , transposase OrfA 

QRF01223 drug resistance transporter, EmrB/QacA family, putative 
QRFQ1224 conserved hypothetical protein 
ORF01225 potassium uptake protein, putative 
ORF01237 membrane protein, putative 

ORF01249 dihydroneopterin aldolase (folB) 

ORF01256 polysaccharide deacetylase family protein 

ORF01273 transcriptional regulator, GntR family/potassioum uptake protein, TrkA family 

ORF01280 conserved hypothetical protein 

ORFQ1281 conserved hypothetical protein 

QRF01289 lipoprotein, putative 
ORF01291 conserved hypothetical protein 

ORFQ1298 conserved hypothetical protein 

ORF01318 conserved hypothetical protein 

ORF01320 voltage-gated chloride channel family protein, putative 
ORF01322 arsenate reductase (arsC) 

ORF01340 dTDP-glucose 4,6-dehydratase (rfbB) 

ORF01341 dTDP-4-dehydrorhamnose 3,5-epimerase 

ORFQ1342 glucose-1 -phosphate thymidylyltransferase (rfbA) 

QRF01356 hypothetical protein 

ORF01368 conserved hypothetical protein 

QRF01374 ISSdyl, transposase OrfB 

ORF01388 transposase OrfA, 1S3 family 

ORF01389 transposase OrfB, IS3 family, truncation 

ORF01391 ISSdyl , transposase OrfB FRAMESHIFT 

ORF01396 transcriptional regulator, Cro/CI family 

QRF01419 repressor protein, putative 

ORF01461 amino acid permease 

ORF01469 conserved hypothetical protein 

ORFQ1483 sensor histidine kinase 

ORFQ1485 GTP pyrophosphokinase family protein 

ORF01490 S'-nucleotidase family protein 

ORF01509 2-dehydropantoate 2-reductase, putative 

ORF01510 regulatory protein, putative 

ORF01522 carbamoyl-phosphate synthase, large subunit, putative 

ORF01542sulfatase 

ORF01549 conserved hypothetical protein 

ORF01554 iron compound ABC transporter, substrate-binding protein 

ORF01557 conserved hypothetical protein 

ORF01563 conserved hypothetical protein T1GR01212 

ORF01583 glycosyltransferase, group 2 family protein 

ORF01584 glycosyltransferase, group 2 family protein 

ORF01585 glycosyltransferase, putative 

ORF01586 dTDP-4-dehydrorhamnose reductase (rfbP) 

ORF01593 conserved hypothetical protein 

ORF01599 conserved hypothetical protein 

ORF01600 glycerol-3-phosphate transporter, putative 

ORF01639 conserved hypothetical protein 

ORF01650 nitroreductase family protein 

ORF01653 amino acid permease 

ORF01665 transcriptional regulator, MutR family 

ORF01683 MutT/nudix family protein 
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ORFxxxxx Annotation 

ORF01686 67 kDa Myosin-cross reactive streptococcal antigen 
QRF01688 peptide methionine sulfoxide reductase (msrA) 
ORF01694 peptide ABC transporter, permease protein 
ORF01704 conserved hypothetical protein 

ORF01705 1S861, transposase OrfA 

QRF01741 membrane protein, putative 
ORF0177Q conserved hypothetical protein 

QRFQ1772 1S1548, transposase 

QRF01790 conserved hypothetical protein 
ORF01794 conserved hypothetical protein 

QRF0180Q amino acid ABC transporter, substrate-binding protein 

ORF01810 1S1548, transposase 

ORF01827 sodiumrdicarboxylate symporter family protein 

QRF01877 immunogenic secreted protein, putative 

ORF01913 transcriptional regulator, Cro/CI family 

ORF01928 membrane protein, putative 

ORF01931 transporter, putative 

ORF01932 transcriptional regulator, Crp/Fnr family 

QRFQ1947 transcriptional regulator, merR family 

QRF01970 acid phosphatase 

ORF02002 amino acid ABC transporter, permease protein 

ORF02028 perfringolysin O regulator protein (pfoR) 

ORFQ2029 conserved hypothetical protein 

ORF02031 expressed protein of unknown function 

ORF02032 expressed protein of unknown function 

ORF02035 deoxynucleoside kinase family protein 

ORF02042 alky! hydroperoxide reductase, subunit C (ahpC) 

ORF02126 transcriptional regulator, MarR family 

ORF02128 N-acetylmuramoyl-L-alanine amidase, family 4 protein 

ORF02135 malate oxidoreductase 

ORF02136 citrate carrier protein, CCS family 

QRFQ2137 sensor histidine kinase family protein 

ORF02138 response regulator 

ORF02166 conserved hypothetical protein 

ORF02169 PTS system, HB component 

QRF0217Q PTS system, HA component, putative 

ORF02202 ABC transporter, ATP-binding protein 

ORF02262 ABC transporter, ATF-binding protein 

ORF02270 cAMP factor (cfb) 

ORF02280 serine protease, subtilase family, putative 

ORF02286 major facilitator family protein 

ORF02292 preprotein translocase, SecE subunit, putative 

QRFQ2295 Lyme disease proteins of unknown function, putative 

ORF02298 Na+ dependent nucleoside transporter 

ORF02301 transcriptional regulator, GntR family 

ORF02313 virulence factor MviM, putative 

ORF02316 membrane protein, putative 

QRFQ2319 conserved hypothetical protein TIGRQ0250 

ORF02328 transporter, putative 

ORF02331 cold shock protein, CSD family 

ORF02332 DNA mismatch repair protein HexA (hexA) 

ORF02335 conserved hypothetical protein 

ORF02372 conserved hypothetical protein 

ORF02383 expressed putative lipoprotein 

ORF02393 transporter, putative 
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ORFQ2398 transcriptional regulator, Crp/Fnr family ~ 
ORF02399 conserved hypothetical protein ~ 

QRF02401 acetyltransferase, GNAT family ~ ^ JZI I — — — — — 

ORF02403 arginine/ornithine antiporter (arcD) 
ORF03Q02 conserved hypothetical protein, truncation v 
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Table 1 1 : GBS genes not shared with GAS or pneumococcus 



ORFxxxxx Annotation 

ORF00008 protease, putative 

QRFQQQ1Q acyl carrier protein (acpP) 

QRF00016 acetyltransferase, GNAT family 

QRF00018 peptidase, M23/M37 family, putative secreted protein 

ORF00035 membrane protein, putative 

ORF0QQ87 lipoprotein, putative 

ORF00088 hypothetical protein 

ORF00089 hypothetical protein 

ORF00091 conserved hypothetical protein 

ORF00117 ribose ABC transporter, periplasmic D-ribose-binding protein (rbsB) 

QRF00118 ribose ABC transporter, permease protein (rbsC) 

ORFQ0120 ribose ABC transporter protein RbsD (rbsD) 

ORF00121 ribokinase (rbsK) 

ORF00123 hypothetical protein 

ORFQ0130 argininosuccinate lyase (argH) 

ORF00137 conserved hypothetical protein 

ORFQQ138 hypothetical protein 

ORFQ0166 4-diphosphocytidyl"2C-methyl-D~erythritol kinase (ispE) 

QRF00182 conserved domain protein 

QRFQ0186 transcriptional regulator, Cro/Cl family 

ORF00187 hypothetical protein 

ORF00188 hypothetical protein 

ORF00192 hypothetical protein 

QRF0Q193 conserved hypothetical protein 

ORF00196 conserved hypothetical protein 

ORF00199 hydrolase, haloacid dehalogenase-like family 

ORF0Q200 sensor histidine kinase, putative 

ORF00201 response regulator 

ORF00203 conserved hypothetical protein 

ORFQQ204 membrane protein, putative 

ORF00205 hypothetical protein 

ORF0Q228 lipoprotein, putative 

ORFQ0234 hypothetical protein 

ORF00235 hypothetical protein 

ORF00238 hypothetical protein 

ORF00240 transcriptional regulator, Cro/Cl family 

ORF00241 hypothetical protein 

ORF00242 conserved hypothetical protein 

ORF00243 hypothetical protein 

ORF00244 conserved domain protein 

ORFQ0245 conserved hypothetical protein, fusion 

ORF00246 replication initiation protein, putative 

ORF00247 hypothetical protein 

ORF00248 recombination protein 

ORFQ0249 hypothetical protein 

ORF00252 conserved hypothetical protein 

ORF00253 hypothetical protein 

QRFQ0254 hypothetical protein 

ORF00255 hypothetical protein 

ORF00256 hypothetical protein 

QRF00257 hypothetical protein 

ORF00258 hypothetical protein 

ORF00259 hypothetical protein 

ORF00260 hypothetical protein 

ORFQ0272 expressed putative lipoprotein 
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ORFxxxxx Annotation 

ORFQ0273 hypothetical protein 

ORF00274 hypothetical protein 

QRF00275 hypothetical protein 

ORFQ0276 hypothetical protein 

ORF00278 membrane protein, putative 

ORF0Q279 transcriptional regulator, Cro/CI famiiy 

ORF00280 acetyltransferase, GNAT family 

ORF00281 acetyltransferase, GNAT family 

ORF00283 conserved hypothetical protein 

QRFQ0284 RNA polymerase sigma factor, ECF subfamily 

ORF00285 lipoprotein, putative 

QRF00287 transcriptional regulator, TetR family 

ORF00288 ABC transporter efflux protein, DrrB family, putative 

ORF00292 hypothetical protein 

ORFQ0294 expressed protein of unknown function 

ORF00298 acyl carrier protein phosphodiesterase, putative 

ORF00308 conserved hypothetical protein 

ORF00324 conserved hypothetical protein 

ORF00332 hypothetical protein 

ORF00340 hypothetical protein 
ORF00347 conserved hypothetical protein 
ORF00384 hypothetical protein 
ORF00402 membrane protein, putative 

ORFQ0408 hypothetical protein 

ORFQ0409 membrane protein, putative 
QRF00414 conserved hypothetical protein 

ORFQ0416 hypothetical protein 

ORFQ041 7 hypothetical protein 

ORF00433 copper-transporter protein CopZ 

ORF00448 hypothetical protein 

QRF00466 conserved hypothetical protein 

ORF00467 acetyltransferase, GNAT family 
ORF00475 conserved domain protein 
ORF00476 hypothetical protein 

ORF00478 carboxymuconolactone decarboxylase family protein 

ORF00479 conserved hypothetical protein 

QRF00486 transcriptional regulator, AraC family 

ORF00487 surface protein Rib 

ORF00488 transposase, IS256 family, truncation 

ORF00489 DNA-damage-inducible protein J, putative 

QRF00490 hypothetical protein 

ORF00491 lipoprotein, putative 

QRF00493 bacteriophage L54a, integrase, truncation 
ORF0Q497 conserved domain protein 

QRF00503 oxidoreductase, Gfo/ldh/MocA family 

QRF00506 transposase, 1S256 family 

QRF0Q510 bacteriocin transport accessory protein, putative 

QRF00512 hypothetical protein 

ORF00526 biotin synthetase (bioB) 

ORF00527 hypothetical protein 

ORF00533 type IV prepilin peptidase-related protein 

ORF00538 conserved hypothetical protein 

ORFQ0556 hypothetical protein 

ORF00563 expressed protein of unknown function 

ORF00575 hypothetical protein 
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ORF00584 conserved hypothetical protein 

ORF00585 fructose- 1 ,6-bisphosphatase, putative 

ORF00590 carboxymethylenebutenolidase-related protein 

ORF0Q597 conserved hypothetical protein 

QRF0Q598 inosine-uridine preferring nucleoside hydrolase 

ORF00599 hypothetical protein 

ORFQQ60Q QsmC/Ohr family protein 

ORF00608 adenosine deaminase, putative 

ORF00610 chorismate mutase, putative 

ORF0Q615 prophage LambdaSal, site-specific recombinase, phage integrase family 

ORF0Q617 conserved domain protein 
ORF0061 8 hypothetical protein 
ORF0Q620 hypothetical protein 
ORF00621 conserved hypothetical protein 
ORF0Q623 hypothetical protein 
ORFQQ624 hypothetical protein 

ORF0Q626 prophage LambdaSal , transcriptional regulator, Cro/Cl family 

ORF0Q628 hypothetical protein 

ORF00630 hypothetical protein 

ORF00632 hypothetical protein 

ORF0Q633 conserved hypothetical protein 

ORF00635 hypothetical protein 

ORF00636 hypothetical protein 
ORF00637 hypothetical protein 
ORF0Q638 conserved hypothetical protein 

ORF00639 conserved domain protein ~ 
ORF00641 prophage LambdaSal , reverse transcriptase/maturase fam)\y protein 

ORF00642 conserved hypothetical protein 

ORF00643 conserved hypothetical protein 
ORF00644 hypothetical protein 
ORF00645 hypothetical protein 

ORF00646 conserved hypothetical protein 

ORFQ0647 hypothetical protein 

ORF00649 hypothetical protein 
ORF00650 hypothetical protein 
ORF00652 conserved hypothetical protein 

ORF00653 conserved hypothetical protein 

ORF00657 conserved hypothetical protein, truncation 

ORF00661 conserved hypothetical protein 

ORF00667 conserved hypothetical protein " 

ORF00670 prophage LambdaSal , minor structural protein, putative 

ORF00671 prophage LambdaSal, N-acetylmuramoyl-L-alanine amidase, family 4 

ORF00672 prophage LambdaSal, minor structural protein, putative 

QRF00673 hypothetical protein 

ORF00674 hypothetical protein 

ORF00675 conserved hypothetical protein 

ORF00676 conserved hypothetical protein 

ORF00678 conserved hypothetical protein 

ORF00681 conserved hypothetical protein 

ORF00682 hypothetical protein 

ORF00683 prophage LambdaSal, site-specific recombinase, phage integrase family FRAMESH1FT 

ORF00685 conserved hypothetical protein 

ORF00689 conserved hypothetical protein, FRAMESHIFT 

ORF00698 hypothetical protein 

ORFQ0703 phosphoserine phosphatase SerB (serB) ~~ 
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ORFxxxxx Annotation 

QRFQQ7Q4 MutT/nudix family protein 

ORFQ0712 hypothetical protein 

ORF00718 cell wall surface protein, interruption-N 

QRF00723 hypothetical protein 

ORFQ0726 transcriptional regulator, AraC family 

ORF00727 expressed cell wall surface anchor family protein 
QRF0Q728 expressed cell wall surface anchor family protein 

ORF00735 expressed protein of unknown function 

ORFQ0737 conserved hypothetical protein, degenerate 

ORF00738 hypothetical protein 

ORF00740 hypothetical protein 

ORF00741 hypothetical protein 

ORF00742 lipoprotein, putative 

ORF00747 cylD protein (cylD) 

ORF00749 acyl carrier protein AcpC 

ORF00750 cylZ protein FRAMESHIFT 

ORF00752 cylB protein (cylB) 

ORFQ0753 cylE protein (cylE) 

ORF00754 cylF protein (cylF) 

QRF00756 cylJ protein (cylJ) 

ORF00757 cylK protein (cylK) 

ORF00758 hypothetical protein 

QRF00759 putative secreted protein 

ORF00761 hypothetical protein 

QRF00766 expressed putative secreted protein 

ORF0Q767 hypothetical protein 

QRF00768 conserved domain protein 

ORF0Q769 permease, putative 

ORF00775 conserved hypothetical protein 

ORF0Q777 DedA family protein, putative 

ORF00779 membrane protein, putative 

ORF00788 sodium :galactoside symporter family protein, putative 

ORF0Q791 transcriptional regulator, GntR family 

ORF00793 Glucuronate isomerase (uxaC) 

ORF00794 mannonate dehydratase (uxuA) 

ORF00795 D-mannonate oxidoreductase 

ORF00796 hydrolase, haloacid dehalogenase-iike family 

ORF00797 glycosyl hydrolase, family 3 

ORF00806 conserved hypothetical protein 

QRF00822 ABC transporter, ATP-binding protein 

ORF00827 hypothetical protein 

ORF00834 conserved hypothetical protein 

QRF00838 membrane protein, putative 

ORFQ0839 Mn2+/Fe2+ transporter, NRAMP family 

ORF00848 conserved domain protein 

ORF00872 cell wall surface anchor family protein 

ORF00874 conserved hypothetical protein 

ORF00878 ABC transporter, permease protein 

ORF00879 YaeC family protein, putative 

QRF00888 hydrolase, haloacid dehalogenase-like family 

ORF00891 conserved domain protein 

ORF0Q898 conserved hypothetical protein 

ORF00900 permease, GntP family 

ORF00903 transcriptional regulator, MarR family 

ORF00907 glutathione S-transferase family protein 



510 



WO 2004/018646 



PCT/US2003/026827 



Table 11: GBS genes not shared with GAS or pneumococcus 
ORFxxxxx Annotation 

ORF00909 hypothetical protein 

ORFQ0921 membrane protein, putative 

QRFQ0922 glycosyl transferase, family 8 

ORF00923 hypothetical protein 

ORF00924 conserved hypothetical protein 

QRFQ0939 conserved hypothetical protein 

ORF00942 expressed putative secreted protein 

ORF00943 hypothetical protein 

ORF0Q944 hypothetical protein 

ORF00946 conserved hypothetical protein 

QRFQ0950 hypothetical protein 

ORF00951 transcriptional regulator, TenA family 

ORF00972 ATP synthase F0, C subunit (atpE) 

ORF0098Q conserved hypothetical protein 

ORFQ0982 conserved hypothetical protein 

ORF01003 conserved hypothetical protein 

QRF01004 conserved hypothetical protein 

ORF01013 hypothetical protein 

ORF01014 hypothetical protein 

ORF01015 hypothetical protein 

ORF01016 hypothetical protein 

ORF01018 hypothetical protein 

ORF01019 hypothetical protein 

ORF01021 hypothetical protein 

ORF01025 HP domain protein 

QRF01026 acetyltransferase, GNAT family 

QRF01032 chloramphenicol acetyltransferase (cat) 

ORF01034 Tn916, transposase 

QRF01035 Tn916 t excisionase 

ORF01037 Tn916, hypothetical protein 

QRF01038 Tn916, hypothetical protein 

ORFQ1039 Tn916, transcriptional regulator, putative 

ORF01041 Tn916, hypothetical protein 

ORF01Q42 Tn916, NLP/P60 family protein 

ORF01044 membrane protein, putative FRAMESHIFT 

ORF01048 Tn916, hypothetical protein 

ORFQ1Q49 Tn916, hypothetical protein 

ORF01050 Tn916, hypothetical protein 

ORF01Q51 Tn916, transcriptional regulator, putative 

ORF01052 Tn916, FtsK/SpolllE family protein 

QRF01053 Tn916, hypothetical protein 

QRF01054 Tn916, hypothetical protein 

ORF01062 hypothetical protein 

QRFQ1086 Na+/H+ exchanger family protein 

ORF01092 acetyltransferase, GNAT family 

ORF01096 nisin-resistance protein, putative 

QRFQ1 1 03 conserved hypothetical protein 

ORF01 124 acetyltransferase, GNAT family 

ORF01133 iron-compound ABC transporter, iron-compound-binding protein 

ORF01140 conserved hypothetical protein 

ORF01 142 carbon starvation protein CstA, putative 

ORF01 143 response regulator 

ORF01 144 sensor hlstidlne kinase, putative 

QRF01145 lipoprotein, putative 

ORF01146 conserved hypothetical protein, FRAMESHIFT 
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ORF01148 lipoprotein, putative 

ORF01149 hypothetical protein 

ORF01150 hypothetical protein 

ORFQ1151 hypothetical protein 

ORF01152 lipoprotein, putative 

ORF01 1 53 hypothetical protein 

QRF01157 conserved hypothetical protein 

ORF01158 hypothetical protein 

ORF01 1 59 hypothetical protein 

ORF01 160 expressed protein of unknown function FRAMESHIFT 

ORF01161 expressed conserved domain protein 

ORFQ1162 conserved hypothetical protein 

ORF01164 FtsK/SpolllE family protein FRAMESHIFT 

ORFQ1166 hypothetical protein 

ORF01167 conserved hypothetical protein 

ORF01168 conserved hypothetical protein 

ORF01169 hypothetical protein 

QRF01172 phage infection protein, putative 

ORF01173 conserved hypothetical protein 

QRF01 174 conserved domain protein 

ORF01175 hypothetical protein 

ORF01182 membrane protein, putative 

ORF01 186 cell wall surface anchor family protein, putative 

ORF01187 hypothetical protein 

ORFQ12Q4 hypothetical protein 

ORF01215 hypothetical protein 

ORFQ1241 transcriptional regulator, AraC family, putative 

QRF01253 rarD protein (rarD) 

ORF01257 transporter, BCCT family protein 

ORF01258 hypothetical protein 

ORF01261 expressed protein of unknown function 

QRF01262 conserved hypothetical protein, FRAMESHIFT 

QRF01263 hypothetical protein 

QRF01265 hypothetical protein 

ORF01266 hypothetical protein 

QRF01269 conserved hypothetical protein 

QRF01272 conserved hypothetical protein 

ORF01277 conserved hypothetical protein 

QRF01287 conserved hypothetical protein 

QRFQ1288 membrane protein, putative 

ORF01299 CMP-N-acetylneuraminic acid synthetase NeuA (neuA) 

QRF013Q0 neuP protein (neuP) 

ORF013Q1 UPP-N-acetylglucosamine-2-epimerase NeuC (neuC) 

ORF01302 N-acetyl neuramic acid synthetase NeuB (neuB) 

ORF01303 polysaccharide biosynthesis protein CpsL (cpsL) 

ORF01304 polysaccharide biosynthesis protein CpsK(V) (cpsK) 

QRF01307 glycosyltransferase CpsN(V) (cpsN) 

ORF01308 polysaccharide biosynthesis protein CpsM(V) (cpsM) 
ORF01309 polysaccharide biosynthesis protein cpsH(V) (cpsH) 

ORF01310 glycosyltransferase CpsG(V) (cpsG) 

ORF01311 polysaccharide biosynthesis protein CpsF (cpsF) 

ORF01312 glycosyltransferase CpsE (cpsE) 

ORF01348 conserved domain protein 

QRF01 349 hypothetical protein ~ 
ORF01370 conserved hypothetical protein 



512 



WO 2004/018646 



PCT/US2003/026827 



Table 1 1 : GBS genes not shared with GAS or pneumococcus 



ORFxxxxx Annotation 

ORF01371 conserved hypothetical protein 

QRF01372 expressed protein of unknown function 

ORF01373 ISSdyl, transposase QrfA 

ORF01375 conserved hypothetical protein 

ORF01379 transposase OrfB, IS3 famiiy, truncation 

ORF01382 GBSil, group ll intron, maturase 

QRF01384 hypothetical protein 

ORF01385 hypothetical protein 

QRF01386 conserved hypothetical protein 

ORF01387 conserved hypothetical protein, truncation 

ORF01390 ISSdyl , transposase QrfA FRAMESHIFT 

QRF01392 hypothetical protein 

ORF01 393 hypothetical protein 

ORFQ1394 site-specific recombinase, phage integrase family 

QRF01395 conserved hypothetical protein 

ORF014Q1 transposase, )SL3 family 

ORF01404 mercuric resistance operon regulatory protein MerR (merR) 

ORFQ1408 cadmium efflux system accessory protein (CadC) 

ORF01409 conserved hypothetical protein 

ORF01410 hypothetical protein 

ORF01417 hypothetical protein 

ORF01418 hypothetical protein 

QRF01420 hypothetical protein 

ORF01421 ImpB/MucB/SamB family protein 

ORF01423 conserved hypothetical protein 

ORF01424 conserved hypothetical protein 

ORF01425 conserved hypothetical protein 

ORF01426 conserved hypothetical protein 

QRF01427 hypothetical protein 

ORF01428 conserved hypothetical protein 

ORFQ1430 hypothetical protein 

ORF01431 hypothetical protein 

ORF01432 conserved domain protein 

ORF01433 SNF2 family protein 

ORF01434 hypothetical protein 

ORF01435 calcium-binding protein, putative 

ORF01436 agglutinin receptor (ssp-5) 

ORFQ1437 abortive infection protein AbiGl (abiGI) 

ORFQ1438 abortive infection protein AbiGII (abiGII) 

ORF01439 conserved hypothetical protein 

ORFQ1440 expressed protein of unknown function 

ORF01441 conserved hypothetical protein, degenerate 

QRF01442 membrane protein, putative 

ORF01443 hypothetical protein 

ORF01444 Tn5252, Orf 21 protein, internal deletion 

ORF01445 hypothetical protein 

ORF0145Q conserved hypothetical protein 

ORFQ1452 hypothetical protein 

ORF01454 conserved hypothetical protein 

ORF01459 hypothetical protein 

ORFQ1460 homocysteine S-methyltransferase MmuM, putative 

ORF01463 hypothetical protein 

ORF01464 hypothetical protein 

QRF01465 hypothetical protein 

ORF01466 transcriptional regulator, TetR family 
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ORFQ1477 glutathione S-transferase family protein, putative 

ORF01478 conserved domain protein - 

ORF01486 hypothetical protein 

ORF01488 R5 protein 

QRF01489 transcriptional regulator, MarR family, putative 

ORF01494 membrane protein, putative 

ORF01497 acetyltransferase, GNAT family 

ORF01 502 hypothetical protein 

ORF01503 conserved hypothetical protein 

QRF01508 surface antigen-related protein 

QRF01535 conserved hypothetical protein 

QRF01547 conserved hypothetical protein 

QRFQ1566 expressed cell wall surface anchor family protein 

ORF01572 glycosyltransferase, group 1 family protein 

QRF01573 glycosyltransferase, group 2 family protein 

ORF01575 membrane protein, putative 

QRFQ1576 glycosyltransferase, group 2 family protein 
ORF01 577 glycosyltransferase, group 2 family protein 
ORFQ1578 nucleotide sugar dehydratase, putative 
ORFQ1581 lipoprotein, putative 
ORF01582 conserved hypothetical protein 
ORFQ1596 ammonium transporter family protein 
QRF01597 conserved hypothetical protein 
ORF01601 hypothetical protein 

ORF01608 proton/peptide symporter family protein 

ORF01611 hypothetical protein 

ORF01615 conserved domain protein ™~~ 
ORF01638 conserved hypothetical protein 
ORF01641 conserved hypothetical protein 

QRF01645 cell wall surface anchor family protein 

QRF01660 membrane protein, putative — 
ORF01661 ABC transporter, ATP binding protein 
QRF01 666 hypothetical protein 

ORF01667 hypothetical protein 

ORF01670 hypothetical protein 

ORF01672 protease, putative, POINT MUTATION 

ORF01673 hypothetical protein 

ORF01674 hypothetical protein 

ORF01 675 hypothetical protein 

ORF01680 tetracenomycin polyketide synthesis O-methyltransferase TcmP, putative 

ORF01681 hypothetical protein 

ORF01682 hypothetical protein 
ORF01 684 hypothetical protein 

ORF01692 peptide ABC transporter, ATP-binding protein 
ORF01 695 peptide ABC transporter, permease protein 
ORF01696 peptide ABC transporter, peptide-binding protein 
ORF01699 transposase, 1S30 family, putative 
ORF01 700 transporter, major facilitator family 

ORF01703 transcriptional regulator, LysR family 

ORF01 71 5 conserved hypothetical protein 
ORF01 71 9 hypothetical protein ; 

ORF01720 conserved hypothetical protein 

ORF01 721 glyoxalase family protein 

ORF01727 conserved hypothetical protein 

ORF01729 acetyltransferase, GNAT family 
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ORF01 730 glycosyl transferase, group 2 family protein 

ORF01733 hypothetical protein 

ORF01734 conserved hypothetical protein 

QRF01735 hypothetical protein 

ORF01736 hypothetical protein 

ORF01737 hypothetical protein 

ORF01742 hypothetical protein 

ORF01743 PTS system component putative 

ORF01744 conserved hypothetical protein 

ORF01 748 D-isomer specific 2-hydroxyacid dehydrogenase family protein 
ORFQ1753 conserved hypothetical protein 
ORF01 754 hypothetical protein 

ORF01761 transposase, 1S30 family, putative, truncation 

ORF01778 amino acid permease, putative 

QRF01807 hypothetical protein 

QRF01836 hypothetical protein 

ORF01838 hypothetical protein 

ORF01839 dihydroxyacetone kinase family protein 

ORF01840 transcriptional regulator, TetR family, putative 

QRFQ1842 hypothetical protein 
ORF01843 dihydroxyacetone kinase family protein 
ORF01844 dihydroxyacetone kinase family protein 
ORF01847 conserved hypothetical protein 
QRF01850 hypothetical protein 

ORF01863 pyruvate phosphate dikinase (ppdK) ^ 

QRF01864 expressed protein of unknown function 

ORF01 865 CBS domain protein 

ORF01866 3-hydroxyacyl-CoA dehydrogenase family protein, putative secreted protein 
ORF01892 hypothetical protein 
ORF01893 hypothetical protein 

ORF01894 conserved hypothetical protein 

ORF01895 hypothetical protein 

ORF01896 hypothetical protein 

QRFQ1 897 hypothetical protein 

ORF01898 hypothetical protein 

ORF01899 hypothetical protein 

ORF01 903 conserved hypothetical protein 

QRF01904 drug resistance transporter, EmrB/QacA family 

ORF01905 hypothetical protein 

ORF01922 conserved hypothetical protein 

ORF01925 FMN-binding protein 

ORF01 934 hypothetical protein 

ORF01 936 polyprenyl synthetase family protein ~~ ~~~ 

QRF01939 cytochrome d ubiquinol oxidase, subunit ll (cydB) 

ORF01940 cytochrome d oxidase, subunit I (cydA) 

ORF01941 pyridine nucleotide-disulphide oxidoreductase family protein 

ORF01942 prenyltransferase, UbiA family 

ORF01943 hypothetical protein 

ORF01944 hypothetical protein 

ORF01946 cyclopropane-fatty-acyl-phospholipid synthase (cfa) 

ORF01951 conserved hypothetical protein 

ORF01953 hypothetical protein 

ORF01 954 conserved hypothetical protein 

ORF01984 hypothetical protein 

ORF01988 hypothetical protein 
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ORF01989 hypothetical protein 
ORF0199Q hypothetical protein 
ORFQ1991 hypothetical protein 
ORF02000 membrane protein, putative 
ORF02001 transposase, IS30 family, putative 
ORF02005 hypothetical protein 

QRF02QQ6 xylulose-5-phosphate/fructose-6-phosphate phosphoketolase (xfp) 

ORF02009 conserved hypothetical protein 

ORF0201Q carbohydrate kinase, FGGY family 

ORF0201 1 hypothetical protein 

ORF02012 PTS system component, putative 

QRFQ201 5 glyoxylate reductase, NADH-dependent 

ORF0201 6 hypothetical protein 

QRF02025 hypothetical protein 

QRF02026 hypothetical protein 

ORF02030 glutamate-cysteine Hgase-related protein 

ORF02036 phosphinothricin N-acetyitransferase (pat) 

QRF02039 conserved hypothetical protein 

ORF02044 conserved hypothetical protein 

ORF02045 conserved hypothetical protein 

ORF02046 prophage LambdaSa2, lysin, putative 

ORF02Q47 prophage LambdaSa2, holin, putative 

QRF02048 conserved hypothetical protein 

ORF02049 hypothetical protein 

QRF02050 conserved domain protein 

ORF02051 prophage LambdaSa2, PbIB, putative 

ORF02Q53 conserved hypothetical protein 

QRF02056 conserved hypothetical protein 

ORF02057 hypothetical protein 

ORF02058 hypothetical protein 

ORF02059 conserved hypothetical protein 

ORF02060 conserved hypothetical protein 

ORF02061 hypothetical protein 

ORF02062 hypothetical protein 

ORF02063 conserved domain protein — 

QRF02064 conserved domain protein 

ORF02066 prophage LambdaSa2, protease, putative 

ORF02Q67 conserved hypothetical protein 

ORF02068 prophage LambdaSa2, terminase large subunit, putative ~ 
ORF02Q69 hypothetical protein 
QRF02Q7Q hypothetical protein 

QRFQ2071 prophage LambdaSa2, site-specific recombinase, phage integrase family 
ORF02072 conserved hypothetical protein 

QRF02073 prophage LambdaSa2, transcriptional regulator, Cro/Cl family 
ORF02075 hypothetical protein 
ORF02Q77 hypothetical protein 

ORF02078 conserved hypothetical protein 

ORF02079 conserved hypothetical protein 

ORF02080 conserved hypothetical protein 
ORF02081 hypothetical protein 

ORF02084 prophage LambdaSa2, bacteriophage replication protein/hypothetical protein, 

truncation/fusion 

ORF02085 hypothetical protein 

ORF02087 hypothetical protein 

ORFQ2088 conserved hypothetical protein 
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ORF02089 prophage LambdaSa2, HNH endonuclease family protein 
ORFQ2090 prophage LambdaSa2, antirepressor protein, putative 

ORF02091 conserved domain protein 

QRF02092 hypothetical protein 

QRF02093 hypothetical protein 

ORF02094 hypothetical protein 

QRFQ2095 prophage LambdaSa2, repressor protein, putative 
ORF02Q97 hypothetical protein 

ORF02098 prophage LambdaSa2, site-specific recombinase, phage integrase family 

ORF02100 hypothetical protein 

ORF02102 hypothetical protein 

ORF02103 microcin immunity protein MccF, putative 

ORF02105 oxidoreductase, Gfo/ldh/MocA family 

ORF02108 hypothetical protein 

ORF02109 Cyclic nucleotide-binding domain protein 

ORF02119 hypothetical protein 

ORF02124 hypothetical protein 

ORF02125 nitroreductase family protein 

ORF021 34 bacteriocin transport accessory protein, putative 

ORFQ2148 neuraminidase-related protein 

ORF02160 2\3 > -cyclk>nucleotide 2 H -phosphodiesterase (cpdB) 

ORF02163 conserved hypothetical protein 

ORF02171 membrane protein, putative 

ORF02172 hypothetical protein 

ORF02173 membrane protein, putative 

ORF02175 conserved hypothetical protein, truncation 

ORF02181 phosphate transport system regulatory protein PhoU, putative 

ORF02187 hypothetical protein 

QRF02190 conserved hypothetical protein 

ORF02191 hypothetical protein 

ORF02194 acetyltransferase, GNAT family 

ORF02196 hypothetical protein 

ORF02198 acetyltransferase, GNAT family 

ORF02201 membrane protein, putative 

ORF02203 hypothetical protein 

ORF02205 transcriptional regulator, Cro/Cl family 

ORF02206 conserved hypothetical protein 

ORF02207 conserved hypothetical protein T1GR0Q730 

ORF022Q8 hypothetical protein 

ORF02209 site-specific recombinase, phage integrase family 

ORFQ2210 conserved hypothetical protein 

ORF02211 conserved hypothetical protein 

QRF02212 hypothetical protein 

ORF02213 hypothetical protein 

QRF02214 transcriptional regulator, Cro/Cl family 

QRF02215 expressed protein of unknown function 

ORF02216 site-specific recombinase, phage integrase family 

ORF02217 conserved hypothetical protein 

ORF02219 hypothetical protein 

QRFQ2221 cell wall anchor protein-related protein 

ORF02223 hypothetical protein 

ORF02224 hypothetical protein 

QRFQ2225 hypothetical protein ■ 

QRFQ2226 membrane protein, putative 

ORF02227 conjugal transfer protein, interruption-C 
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ORFQ2230 conserved hypothetical protein 

ORF02231 conserved hypothetical protein 

ORF02232 conserved hypothetical protein 

ORF02235 hypothetical protein 

ORF02236 conserved hypothetical protein 

ORF02237 hypothetical protein 

QRF02238 hypothetical protein 

ORF02239 hypothetical protein 

ORF02240 transcriptional regulator, Cro/CI family 

ORF02241 hypothetical protein 

ORF02242 transcriptional regulator, Cro/CI family 

ORF02243 FtsK/SpoMIE family protein 

ORF02244 hypothetical protein 

ORF02245 hypothetical protein 

ORF02246 cell wall surface anchor family protein 

ORF02247 transposase, ISL3 family 

ORF02250 mercuric resistance operon regulatory, protein MerR (merR) 

ORF02251 Mn2+/Fe2+ transporter, NRAMP family 

ORF02252 membrane protein, putative 

ORF02253 ABC transporter, ATP-binding protein 

ORF02254 conserved hypothetical protein 

ORFQ2255 streptomycin resistance protein 

ORF02257 hypothetical protein 

ORFQ2258 hypothetical protein 

ORF02259 conserved hypothetical protein . 

ORF02260 acetyltransferase, GNAT family 

ORFQ2261 membrane protein, putative 

QRF02263 hypothetical protein 

ORF02264 transcriptional regulator, Cro/CI family 

ORF02265 PAP2 family protein 

ORF02266 conserved hypothetical protein FRAMESHIFT 

ORF02267 conserved hypothetical protein TIGR00730 

ORF02268 protease, putative 

ORF02269 rhodanese family protein 

ORF02271 hypothetical protein 

ORFQ2274 conserved hypothetical protein 

ORF02275 5-methyltetrahydrofolate-homocysteine methyltransf erase, putative 

ORF02277 conserved hypothetical protein 

ORF02279 hypothetical protein 

ORF02282 sensor histidine kinase 

ORF02283 chromosome assembly-related protein 

ORF02287 expressed protein of unknown function 

ORF02291 pathogenicity protein, putative b 

ORF02308 hydrolase, haloacid dehalogenase-like family 

ORF02314 conserved hypothetical protein 

ORF02317 hypothetical protein 

ORF02330 hypothetical protein 

ORF02344 site-specific recombinase, phage integrase family 

ORF02345 conserved hypothetical protein 

ORF02346 conserved hypothetical protein 

ORF02347 hypothetical protein 

ORF02349 conserved hypothetical protein 

QRFQ235Q hypothetical protein 

ORF02351 transcriptional regulator, Cro/CI family 

ORF02352 conserved domain protein 
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ORF02354 hypothetical protein 

QRF02356 expressed putative secreted protein 

ORF02362 sensor histidine kinase 

ORF02363 response regulator 

ORF02367 membrane protein, putative 

QRF02368 conserved hypothetical protein 

QRF02379 membrane protein, putative 

ORF02395 transcriptional regulator, Cro/Cl family 

ORF02406 membrane protein, putative 

ORFQ241 6 diacylglyceroi kinase catalytic domain protein, putative 

QRF02418 hypothetical protein 

ORF02422 hypothetical protein 

ORF02425 conserved hypothetical protein 

ORF03Q01 conserved hypothetical protein 

ORF03004 conserved hypothetical protein 

ORF03005 cylX protein 

ORFQ30Q6 Tn916, hypothetical protein 

ORF03Q07 Tn916, hypothetical protein 

ORF03008 Tn916, hypothetical protein 
QRF030Q9 Tn916, tetM leader peptide 
ORF03010 Tn916, hypothetical protein 

ORF03012 prophage LambdaSa2, HNH endonuclease family protein 
ORF03013 conserved hypothetical protein 
ORF03Q1 5 conjugal transfer protein, interruption-N 
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ORF00035 membrane protein, putative 

ORF0Q087 lipoprotein, putative , 
ORF00088 hypothetical protein 

QRF00089 hypothetical protein 

ORF00123 hypothetical protein " "~ 

ORF00138 hypothetical protein 

ORF00187 hypothetical protein 

ORF00188 hypothetical protein 

ORFQ0192 hypothetical protein 
QRF0Q205 hypothetical protein 
ORF00228 lipoprotein, putative 
QRFQ0234 hypothetical protein 

ORF00235 hypothetical protein 

ORFQ0238 hypothetical protein 

ORF0024Q transcriptional regulator, Cro/Cl family 
ORFQQ241 hypothetical protein 
QRF00242 conserved hypothetical protein 
ORF00243 hypothetical protein 
QRF00247 hypothetical protein 
ORF00249 hypothetical protein 

ORFQ0253 hypothetical protein 

ORFQ0254 hypothetical protein 
ORF00255 hypothetical protein 

ORF00256 hypothetical protein 

ORF00257 hypothetical protein 

QRF00258 hypothetical protein 
ORF00259 hypothetical protein 
ORF00260 hypothetical protein 

ORF00272 expressed putative lipoprotein __ 

ORFQ0273 hypothetical protein 
ORF0Q274 hypothetical protein 
QRF00275 hypothetical protein 

QRF00276 hypothetical protein 

ORF00278 membrane protein, putative 

ORF00285 lipoprotein, putative 
ORF00292 hypothetical protein 
ORF00294 expressed protein of unknown function 

ORFQQ308 conserved hypothetical protein "~ 
QRF00332 hypothetical protein 
ORF00340 hypothetical protein 

ORF00384 hypothetical protein 

ORF0Q402 membrane protein, putative 
ORFQ0408 hypothetical protein 

ORF00416 hypothetical protein 

QRF00417 hypothetical protein 

ORF00448 hypothetical protein 

ORF0Q476 hypothetical protein 

ORF00489 DNA-damage-inducible protein J, putative 
ORF00490 hypothetical protein 

ORF0Q491 lipoprotein, putative 

ORF00497 conserved domain protein 

ORF00510 bacteriocin transport accessory protein, putative 

QRFQQ512 hypothetical protein 

ORF00527 hypothetical protein 

ORF00556 hypothetical protein 
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ORF00575 hypothetical protein 

QRFQQ599 hypothetical protein 

ORFQ0618 hypothetical protein 
ORF00620 hypothetical protein 
ORF00623 hypothetical protein 

QRFQ0626 prophage LambdaSal , transcriptional regulator, Cro/Cl family 
ORF00628 hypothetical protein 
QRF00630 hypothetical protein 
ORF00632 hypothetical protein 
ORFQ0635 hypothetical protein 

ORF00636 hypothetical protein " 

ORF00637 hypothetical protein 

ORF00642 conserved hypothetical protein 

QRF00644 hypothetical protein 

ORF00645 hypothetical protein 

QRFQ0647 hypothetical protein 

ORF00649 hypothetical protein 

ORFQQ650 hypothetical protein 

ORF00653 conserved hypothetical protein 

ORF0Q657 conserved hypothetical protein, truncation 

QRFQ0661 conserved hypothetical protein 

QRF00673 hypothetical protein 
QRF00674 hypothetical protein 
QRF00675 conserved hypothetical protein 

ORF00676 conserved hypothetical protein ~~ 
QRF00682 hypothetical protein 
ORF00685 conserved hypothetical protein 
ORF00698 hypothetical protein 
ORF0071 2 hypothetical protein . 

ORF0Q718 cell wall surface protein, interruption-N ~~ 
ORF00723 hypothetical protein 

QRF00735 expressed protein of unknown function ~~ 
ORF00737 conserved hypothetical protein, degenerate ~ 
QRF00738 hypothetical protein 
QRF00740 hypothetical protein 
ORF00741 hypothetical protein 
ORF00747 cylD protein (cylD) 

ORF00753 cylE protein (cylE) 

ORF00756 cylJ protein (cylJ) 

ORF0Q757 cylK protein (cylK) 

ORF00758 hypothetical protein 
QRF00759 putative secreted protein 
ORF00761 hypothetical protein 

ORF00796 hydrolase, haloacid dehalogenase-like family 
ORF008Q6 conserved hypothetical protein 
ORF00822 ABC transporter, ATP-binding protein 
ORF00827 hypothetical protein 
ORF00872 cell wall surface anchor family protein 
ORF009Q9 hypothetical protein 
ORF00923 hypothetical protein 
ORF00924 conserved hypothetical protein 
ORF00942 expressed putative secreted protein 
ORF00943 hypothetical protein 
ORF00944 hypothetical protein 
QRFQ1 01 3 hypothetical protein 
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ORF01Q14 hypothetical protein 
QRFQ1 01 5 hypothetical protein 
QRF01 016 hypothetical protein 
ORFQ1 01 8 hypothetical protein 
ORF01019 hypothetical protein 
ORF01021 hypothetical protein 
ORF01 035 Tn91 6, excisionase 
ORF01062 hypothetical protein 
QRF01096 nisin-resistance protein, putative 

ORF01 145 lipoprotein, putative "~ 
ORFQ1 146 conserved hypothetical protein, FRAMESHIFT 
QRFQ1 148 lipoprotein, putative 
ORF01 149 hypothetical protein 

ORF01 1 50 hypothetical protein 

ORF01 1 51 hypothetical protein — 

ORF01 1 52 lipoprotein, putative 

ORF01 1 53 hypothetical protein 

QRF01 1 58 hypothetical protein 

ORF01 1 59 hypothetical protein 

ORF01161 expressed conserved domain protein 

ORF01 1 62 conserved hypothetical protein 

ORF01166 hypothetical protein 

ORF01 1 68 conserved hypothetical protein 

ORF01 1 69 hypothetical protein 

ORFQ1174 conserved domain protein 
ORF01175 hypothetical protein 

ORF01 186 cell wall surface anchor family protein, putative 

ORF01187 hypothetical protein ~ " 

ORF01 204 hypothetical protein ' 

ORF01 21 5 hypothetical protein 
QRF01258 hypothetical protein 

ORF01262 conserved hypothetical protein, FRAMESHIFT 
ORF01263 hypothetical protein 

QRF01265 hypothetical protein 

ORF01266 hypothetical protein 

ORF01304 polysaccharide biosynthesis protein CpsK(V) (cpsK) 

QRF01308 polysaccharide biosynthesis protein CpsM(V) (cpsM) 
ORF01 309 polysaccharide biosynthesis protein cpsH(V) (cpsH) 

QRF01 349 hypothetical protein 

ORF01384 hypothetical protein 

ORF01 385 hypothetical protein 

ORF01 386 conserved hypothetical protein 

ORF01392 hypothetical protein 

QRF01395 conserved hypothetical protein 

QRF01409 conserved hypothetical protein 
ORF01 41 0 hypothetical protein 
ORF01417 hypothetical protein 
ORF01418 hypothetical protein 
ORF01420 hypothetical protein 
ORF01423 conserved hypothetical protein 
ORF01424 conserved hypothetical protein 

ORF01425 conserved hypothetical protein "~ 
QRFQ1426 conserved hypothetical protein 
QRF01 427 hypothetical protein 

ORF01431 hypothetical protein 
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ORFQ1432 conserved domain protein 

ORF01434 hypothetical protein 

QRF01435 calcium-binding protein, putative 

QRF01437 abortive infection protein AbiGI (abiGI) 

ORF01438 abortive infection protein AbiGIl (abiGll) 

ORF01441 conserved hypothetical protein, degenerate 

ORF01443 hypothetical protein 

ORF01445 hypothetical protein 

ORF01452 hypothetical protein 

ORF01459 hypothetical protein 

ORF01463 hypothetical protein 

ORF01464 hypothetical protein 

ORF01465 hypothetical protein 

ORF01486 hypothetical protein 

ORF01488 R5 protein 

ORF01575 membrane protein, putative 

ORF01581 lipoprotein, putative 

ORFQ16Q1 hypothetical protein 

ORF01611 hypothetical protein 

ORF01638 conserved hypothetical protein 

QRF01645 cell wall surface anchor family protein 

ORF01660 membrane protein, putative 

ORF01666 hypothetical protein 

QRF01 667 hypothetical protein 

ORF01670 hypothetical protein 

ORF01673 hypothetical protein 

ORF01674 hypothetical protein 

ORF01675 hypothetical protein 

ORF01681 hypothetical protein 

ORF01682 hypothetical protein 

ORF01684 hypothetical protein 

ORF01719 hypothetical protein 

ORF01733 hypothetical protein 

ORFQ1735 hypothetical protein 

ORF01736 hypothetical protein 

ORF01737 hypothetical protein 

ORF01742 hypothetical protein 

ORF01754 hypothetical protein 

ORF01761 transposase, IS30 family, putative, truncation 

QRF01807 hypothetical protein 

ORF01836 hypothetical protein 

ORF01838 hypothetical protein 

ORF01842 hypothetical protein 

ORF01850 hypothetical protein 

ORF01892 hypothetical protein 

ORF01893 hypothetical protein 

ORF01895 hypothetical protein 

ORFQ1896 hypothetical protein 

ORF01897 hypothetical protein 

ORFQ1898 hypothetical protein 

ORF01899 hypothetical protein 

QRF01905 hypothetical protein 

ORFQ1934 hypothetical protein 

ORF01943 hypothetical protein 

ORF01944 hypothetical protein 
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ORF01953 hypothetical protein 

QRFQ1984 hypothetical protein 

QRF01988 hypothetical protein 

ORF01989 hypothetical protein 

ORF02005 hypothetical protein 

ORF0201 1 hypothetical protein 

ORF02016 hypothetical protein 

ORF02025 hypothetical protein 

QRF02026 hypothetical protein 

ORF02045 conserved hypothetical protein 

ORF02047 prophage LambdaSa2 1 holin, putative 

ORF02048 conserved hypothetical protein 

ORF02049 hypothetical protein 

QRF02050 conserved domain protein 

ORF02053 conserved hypothetical protein 

ORF02Q57 hypothetical protein 

ORFQ2058 hypothetical protein 

ORF02061 hypothetical protein 

ORF02062 hypothetical protein 

QRF02063 conserved domain protein 

ORF02067 conserved hypothetical protein 

ORFQ2069 hypothetical protein 

ORF0207Q hypothetical protein 

QRF02Q72 conserved hypothetical protein 

ORF02Q73 prophage LambdaSa2, transcriptional regulator, Cro/CI family 

ORF02075 hypothetical protein 

QRF02077 hypothetical protein 

ORF02078 conserved hypothetical protein 

QRF02081 hypothetical protein 

QRF02085 hypothetical protein 

QRF02087 hypothetical protein 

QRF02088 conserved hypothetical protein 

QRF02091 conserved domain protein 

ORFQ2092 hypothetical protein 

QRF02093 hypothetical protein 

ORF02094 hypothetical protein 

ORF02097 hypothetical protein 

QRFQ210Q hypothetical protein 

ORF02102 hypothetical protein 

ORF02108 hypothetical protein 

ORF02119 hypothetical protein 

ORFQ2124 hypothetical protein 

QRF02171 membrane protein, putative 

ORFQ2172 hypothetical protein 

ORFQ2173 membrane protein, putative 

ORF02191 hypothetical protein 

ORFQ2196 hypothetical protein 

ORFQ2203 hypothetical protein 

QRF02208 hypothetical protein 

ORF02212 hypothetical protein 

ORF02213 hypotheticai protein 

ORF02214 transcriptional regulator, Cro/CI family 

ORF02215 expressed protein of unknown function 

ORF02217 conserved hypothetical protein 
QRF02219 hypothetical protein 
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QRF02221 celt wall anchor protein-related protein ~~ 

ORF02223 hypothetical protein 

ORFQ2224 hypothetical protein 

ORFQ2225 hypothetical protein 

ORF02231 conserved hypothetical protein 

ORF02235 hypothetical protein 

ORF02236 conserved hypothetical protein 

QRF02237 hypothetical protein 

ORF02238 hypothetical protein 

ORF02239 hypothetical protein 

ORF02241 hypothetical protein 

ORF02244 hypothetical protein 

ORF02245 hypothetical protein 

ORF02263 hypothetical protein 

ORF02268 protease, putative 

ORF02271 hypothetical protein 

QRF02279 hypothetical protein 

ORF02283 chromosome assembly-related protein 

ORF02317 hypothetical protein 

ORF02330 hypothetical protein 

ORFQ2344 site-specific recombinase, phage integrase family ~~~ 
ORF02345 conserved hypothetical protein 

QRFQ2347 hypothetical protein 

ORF02349 conserved hypothetical protein ~~ 

ORF02350 hypothetical protein 

ORF02351 transcriptional regulator, Cro/Cl family 

ORF02354 hypothetical protein "~ 

ORF02356 expressed putative secreted protein 

ORF02395 transcriptional regulator, Cro/Cl family 

ORF02418 hypothetical protein 

ORFQ2422 hypothetical protein 

ORF02425 conserved hypothetical protein 

ORF03004 conserved hypothetical protein 

QRF03005 cylX protein 

ORF03006 Tn916, hypothetical protein 

ORF03007 Tn916, hypothetical protein 

ORF030Q8 Tn916 t hypothetical protein 

ORF03009 Tn916, tetM leader peptide 

ORF03010 Tn916, hypothetical protein ~~ 
ORF0301 5 conjugal transfer protein, interruption-N 
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Table 13s Comparative Sequences relating to SAG0466 (thiolase) 



SEQ ID NO, 1301: SAG0466 FROM THE 2603V/R GBS STRAIN 

CTCCTGCCCCTGCAATGGCAGTTAGACCCATAGGTTTATTTTTATATTTTAATGCCTGCATAAGATGAAGGATATTAATA 
ATTCCTGAGCAGGCATAAGGGTGTCCGTAAGCTAATGTCCCTCCAAAAATATTGAATTTTTCTCTCTCTTCAGGATAATA 
ATGATTAAATAGAGCATCAATCGCTGCAAATGGTTCATTCCATTCAATTGCATCATAATCCGATATTTTAGTATGAGTTT 
CTGTTAATAGTTTTTCGGTAGCCGTGTGAACCAATTCTGGACTAAGCTTGGGATCTCCTGCTACTTCTACAATGTGAACA 
ATCCGGAATTCTGTTTTCTGACTCTGAAGCGTTAGAAATGCAGCAGCATCGTGCATTAAACAAACATTTCCAATAGTGAG 
CAAAGGTGAATTTTCCATCAATCTTGGTAATTTTTGAAAAAATGTTtCTTTTaGTTTTCTAACGCCTTGATCTCGCATCC 
CTTCCATTGGTAAGATTACyTCTTCTAAATAGCCACCTTGTTTAGCTGTTAAGGCGCGTTTATGGCTCAAGAATGCCAAT 
TTATCTAACATTTCTCTTCTAAAaCCATATTTTTGACAGACTCTCTGGGCCCCTTCTAACATTACAGTTTCAGCATAAGA 
GTCAGGAGAAAACTGAGCAACTGTATATTCTCCGTTACGATTATCTTCTTTAGCATAACGTCTCATAGGTTGAAGAGAAC 
TACTTTCAATCCCCCCAACAAGAACTTTTTCATTAATACCGGTACTGATTTTTAGATAACCAAAAAACAAGGCAGAACTT 
GATGAAGCACACTGCATATCAATCGTTTGTACTGGAATATAGGATTCATAATCAGAAAAAAGAGTCATCAAACGACCAAT 

AAAGGTGTGCTCCTAAAAGTTCTGGACGGTAAGTTTAAATTGCTT 

SEQ ID NO- 1302: SAG0466 FROM THE M732 GBS TYPE III STRAIN 

TCGGTATAAAAGGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAATAAAAAAAATA 
GAATCAGAATCTAATATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCT 
TTTTTCTGATTATGAATCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTG 
GTT ATCT AAAAATC AGTGC CGGTATTAATGAAAAAGTTCTTGTTGGGGGGAT TGAAAGTAGTTC TCTTCAAC C TATGAGA 
CGTTACGCTAAAGAAGATAATCGTAACGGAGAATATACCGTTGCTCAGTTTTCTCCTGACTCTTATGCTGAAACTGTAAT 
GTTAGAAGGGGCACAAAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGAGCCATA 
AACGCGCCTTAACAGCTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAATGGAAGGGATGCGAGATCAAGGCGTT 
AGAAAACTAAAAGAAGCATTTTTTCAAAAATTACCAAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTG 
TTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAGAAGTAG 
C AGGAGAT CCC AAGCTTAGTC CAGAATTGGTTCACACGGCTACGGAAAAAC TATTAACAGAAACTCATACTAAAATATCG 
GATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGA 
AAAATTCAATATTTTTGGAGGGGCATTAGCTTACGGACACCCTTATGCCTGCTCAGGAATTA 

SEQ ID NO. 1303: SAG0466 FROM THE 090 GBS TYPE la STRAIN 

TTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCTTTTTTCTGATTATGAATCCTATATTCCAG 
TACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTGGTTATCTAAAAATCAGTGCCGGTATTAAT 
GAAAAAGTTCTTGTTGGGGGGATTGAAAGTAGTTCTCTTCAACCTATGAGACGTTACGCTAAAGAAGATAATCGTAACGG 
AGAATATACCGTTGCTCAGTTTTCTCCTGACTCTTAkGCTGAAACTGTAATGtTAGAAGGGGCACAAAGAGTCTGTCAAA 
AATATGGTTTtAGAAGAGAAATGTTAGATAAATTGGCATTCTTGAGCCATAAACGCGCCTTAACAGCTAAACAAGGTGGC 
TATTTAGAAGAGGTAATCTTACCAATGGAAGGGATGCGAGATCAAGGCGTTAGAAAACTAAAAGAAGCATTTTTTCAAAA 
ATTACCAAGATTGATGGrAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTwA 
CGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAGAAGTAGCAGGAGATCCGAAGCTTAGTCCAGAATTG 
GTTCACACGGCTACGGAAAAACTATTAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAA.ee 
ATTTGCAGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGCATTAG 
CTTACGGACACCCTTATGCCTGCTCAGG 

SEQ ID NO. 1304: SAG0466 FROM THE COHl GBS TYPE la STRAIN 

ATCGGTATAAAAGGGAAGCAATTTAAAATTACCGTCCAGAACTTTTAGGAGCAGA.CCTCTTAAATCAAATAAAAAAAATA 
GAATCAGZUVTCTAATATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCT 
TTTTTCTGATTATGAATCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTG 
GGTATCTAAAAA 

SEQ ID NO. 1305 : SAG0466 FROM THE CJB GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

TTTTCAAAAATTACCAAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTG 
CATTTCTAACGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAGAAGTAGCAGGAGATCCCAAGCTTAGT 
CCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATG 
GAATGAACCATTTGCAGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAG 
GGGCATTAGCTTACGGACACCCTTAATGCCTGCTCAGGAATTATTAATATCC 

SEQ ID NO. 1306: sag0466 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

GGTATAAAAGGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAATAAAAAAAATATA 
ACCAGAATCTAACATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCTTT 
TTTCTGATTATGAATCCTATATTC 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ ID NO. 1307: SAG04 66 FROM THE 1169NT1 GBS TYPE V STRAIN REVERSE COMPLEMENT 

CAAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTT 
C AGAGTC AGAAAACAGAATTC CGGATTGTTC ACATTGTAG AAGTAGC AGGAGAT C C CAAGCTTAGTC C AGAATTGGTTC A 
CACGGCTACGGAAAAACTATTAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTG 
CAGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGCATTAGCTTAC 
GGAC AC C C TTATGC CTGCTCAGGAATTATTAATATC CTTC ATCTTATGC AGG CATTAAAATATAAAAATAAAC CTATGGG 
CCTAACTGCCATTGCAGGGGCA 

SEQ ID NO. 1308: SAG0466 FROM THE 18RS21 GBS TYPE II STRAIN 

CCTTAACAGTTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAATGGAAGGGATGCGAGATCAAGGCGTTAGAAAA 
CTAAAAGAAACATTTTTTCAAAAATTACCAAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAAT 
GCACGATGC TGC TGCATTTCTAACGC TTC AGAGTC AGAAAACAGAATT CCGGATTGT TCACATTGTAGAAGTAGCAGGAG 
ATC C C AAGCTTAGTCCAGAATTGGTTC ACACGGCTACGGAAAAACTATTAACAGAAACTCATAC TAAAATATCGGATTAT 
GATGCAATTGAATGGAATGAACC ATTTGCAGCGATTGATGCTC TATTTAAT CATTATTAT C C TGAAGAGAGAGAAAAATT 
CAATATTTTTGGAGGGACATTAGCTTACGGACACCCTTATGCCTGCTCAGGAATTATTAATATCCTTCATCTTATGCAGG 
CATTAAAATATAAAAATAAACCTATGGGTCTAACTGCCATTGCAGGGGCAG 

SEQ ID NO. 1309: SAG0466 FROM THE 18RS21 GBS TYPE II STRAIN 

TCGGTATAAAAGGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTTTTAAATCAAATAAAAAAAATA 
GAATCAGAATCTAACATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCT 
TTTTTCTGATTATGAATCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTG 
GTTATCTAAAAATCAGTACCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAGTTCTCTTCAACCTATGAGA 
CGTTATGCTAAAGAAGATAATCGTAACGGAGAATATACAGTTGCTCAGTTTTCTCCTGACTCTTATGCTGAAACTGTAAT 
GTTAGAAGGGGCCCAGAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGAGCCATA 
AACGCGCCTTAACAGCTAAACA 

SEQ ID NO. 1310: SAG0466 FROM THE H36b GBS TYPE lb STRAIN 

TTTGGGCT ACGAAC AC CTATCGGTATAAAAGGGAAGC AATTTAAACATTAC CGTC C AGAACTTTTAGGAGCACAC C TTTT 
AAATCAAATAAAAAAAATAGAATCAGAATCTAACATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATA 
TTGGTCGTTTGATGACTCTT TTTT CTGATTATGAAT C C TAT ATTC C AGTACAAACGATTGATATGCAGTGTGCTT CATCA 
AGTTCTGCCTTGTTTTTTGGTTATCTAAAAATCAGTACCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAG 
TTCTCTTCAACCTATGAGACGTTATGCTAAAGAAGATAATCGTAACGGAGAATATACAGTTGCTCAGTTTTCTCCTGACT 
CTTATGCTGAAAC TGTAATGTTAGAAGGGGCC C 

SEQ ID NO. 1311: SAG0466 FROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

GAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAA 
AACAGAATTCCGGATTGTTCACATTGTAGAAGTAGCAGGAGATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGG 
AAAAACTATTAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGAT 
GCT CT ATTTAATCATTATTAT CCTGAAGAGAGAGAAAAATTC AATATTTTTGGAGGGAC ATTAGCTT ACGGACACC CTTA 
TGCCTGCT CAGGAATTATTAATATCCTT C ATC T TATGC AGGC ATTAAAATATAAAAATAAAC CTATGGGTCTAACTGCCA 
TTGCAGGGGCAGGA 

SEQ ID NO. 1312: SAG0466 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

CCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATT 
CCGGATTGTTC ACATTGTAGAAGTAGC AGGAGATC CCAAGC TTAGTC C AGAATTGGTTC AC ACGGCTACGGAAAAACTAT 
TAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGATGCTTTATTT 
AATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGCATTAGCTTACGGACACCCTTATGCCTGCTC 
AGGAATTATTAATAT C CTTCATCTTATGCAGGC ATTAAAATATAAAAATAAACCTATGGGTTCTAAGTGC 

SEQ ID NO. 1313: SAG0466 FROM THE M781 GBS TYPE III STRAIN 

GCAATTTAAAC ATTAC CGTCC AGAAC TTTTAGGAGCAC AC CTC TTAAATCAAATAAAAAAAATAGAATC AGAAT CTAAT A 
TTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCTTTTTTCTGATTATGAA 
TCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTGGTTATCTAAAAATCAG 
TGCCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAGTTCTCTTCAACCTATGAGACGTTACGCTAAAGAAG 
ATAATCGTAACGGAGAATATAC CGTTG C T C AGTTT TCTCCTGACTCTTATGCTGAAACTGTAATGT TAGA 

SEQ ID NO 1314: SAG0466 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

CCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATT 
CCGGATTGTTCACATTGTAGAAGTAGCAGGAGATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTAT 
TAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGATGCTCTATTT 
AAT CATTATTAT CCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGACATTAGCTTACGGAC AC C CTTATGCCTGCTC 
AGGAATTATTAATATCCTTCATCTTATGCAGGCATTAAAATATAAAAATAAACCTATGGGTCTAACTGCCATTGCAGGGG 
C 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ ID NO.- 1315: SAG0466 FROM THE JM9130013 GBS TYPE VIII STRAIN REVERSE 
COMPLEMENT 

GCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATTCCGGA 
TTGTTCACATTGTAGAAGTAGCAGGAGATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACA 
GAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGATGCTCTATTTAATCA 
TTATTATC CTGAAGAGAGAGAAAAATTCAATATT TTTGGAGGGGCATTAGCTTACGGAC AC CCTTATGC CTGCTC AGGAA 
TTATTAATATCCTTCATCTTATGCAGGCATTAAAATATAAAAATAAACCTATGGGTCTAACTGCCATTGCAGGGGCAGGA 

SEQ ID NO. 1316: SAG0466 FROM THE JM9130013 GBS TYPE VIII STRAIN 

TT TGGGC TACGAAC AC CTAT CGGTATAAAAGGGAAGCAATTTAAAC ATTAC CGTCC AGAACTTTTAGGAGCAC AC CTTTT 
AAATCAAATAAAAAAAATAGAATCAGAATCTAACATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATA 
TTGGTCGTTTGATGACTCTTTTTTCTGATTATGAATCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCA 
AGTTCTGCCTTGTTTTTTGGTTATCTAAAAATCAGTACCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAG 
TTC TCTTC AAC CTATGAGACGT TATGCTAAAGAAGATAATCGTAACGGAGAATATA 

SEQ13 01 CTCCTGCCCCTGCAATGGCAGTTAGACCCATAGGTTTATTTTTATATTTTA 

SEQ1302 -~ 

SEQ1303 

SEQ1304 

SEQ1305 

SEQ1306 

SEQ1307 . 

SEQ13 0 8 CTTAACAGTTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAATGGAAGGGATGC 

SEQ1309 

SEQ1310 

SEQ1311 

SEQ1312 

SEQ1313 " 

SEQ1314 

SEQ1315 

SEQ1316 



SEQ1301 TGC CTGC ATAAGATGAAGGATATTAATAATTCC TGAGC AGGC ATAAGGGTGTC CGTAAG 

SEQ13 02 - r TCGGTATAAA 

SEQ1303 

SEQ13 04 ATCGGTATAAA 

SEQ1305 TTT TC AAAAATTAC CAAGATTGATGG 

SEQ1306 GGTATAAA 

SEQ13 07 CAAGATTGATGG 

SEQ13 0 8 AGATCAAGGCGTTAGAAAACTAAAAGAAACATTTTTTCAAAAATTACCAAGATTGATGG 

SEQ13 09 TCGGTATAAA 

SEQ1310 TTTGGGCTACGAACACCTATCGGTATAAA 

SEQ1311 G 

SEQ1312 ~ 

SEQ1313 

SEQ1314 

SEQ1315 

SEQ131 6 TTTGGGCTACGAACACCTATCGGTATAAA 

SEQ13 0 1 TAATGTCCCTCCAAA - AATATTGAATTTTTCTCTCTC - TTCAGGATAATAATGATTAAA 

SEQ13 0 2 GGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAAT 

SEQ1303 

SEQ1304 GGGAAGCAATTTAAA-ATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAAT 

SEQ13 0 5 AAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ13 0 6 GGGAAGCAAT TTAAACATTAC CGTCC AGAACTTTTAGGAGCACAC CT CTTAAATCAAAT 

SEQ1307 AAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ13 0 8 AAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ13 0 9 GGGAAGCAATTTAAACATTACCGT CC AGAACTTTTAGGAGCAC AC CTT TTAAATCAAAT 

SEQ13 1 0 GGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTTTTAAATCAAAT 

SEQ1311 AAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ13 12 CCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ1313 QCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAAT 

SEQ 13 14 CCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ1315 GCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1316 



GGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTTTTAAATCAAAT 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ13 06 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ13 0 6 
SEQ1307 
SEQ13 08 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



AGAGCATCAATCGCTGCAAATGGTTCATTCC-ATTCAATTGCATCATAATCCGATATTT 

AAAAAAATAGAATCAGAATCTAAT- - ATT GATAGTATTATTTGTGGGAACA- CAGT 

TTGTGGGAACA- CAGT 



AAAAAAATAGAATCAGAATCTAAT - 
AACGCTTCAGAGTCAGAAAACAGA- 
AAAAAAATATAACC AGAATC TAAC - 
AACGCTTCAGAGTCAGAAAACAGA- 
AACGCTTCAGAGTCAGAAAACAGA- 
AAAAAAATAGAATC AGAATCTAAC - 
AAAAAAATAGAATCAGAATCTAAC - 
AACGCTTCAGAGTCAGAAAACAGA- 
AACGCTT CAGAGTC AGAAAACAGA - 
AAAAAAATAGAATCAGAATCTAAT - 
AACGCTTCAGAGTC AGAAAACAGA - 
AACGCTTCAGAGTCAGAAAACAGA- 
AAAAAAATAGAATCAGAATC TAAC - 



-ATT GATAGTATTATTTGTGGGAACA- CAGT 

-ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 
-ATT- - -GATAGTATTATTTGTGGGAACA- CAGT 

- ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 
-ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 

- ATT GAT AGTATT AT TTGTGGGAACA - CAGT 

- ATT GATAGTATTATTTGTGGGAACA - CAGT 

- ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 

- ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 

-ATT GATAGTATTATTTGTGGGAACA- CAGT 

-ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 

- ATTC CGGATTGTTC ACATTGTAGAAGTAGC AGG 
-ATT GATAGTATTATTTGTGGGAACA- CAGT 



AGTATGAGTTTCTGTTAATAGTTTTTCCGTAGCCGTGTGAACCAATTCTGGACTAAGCT 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA- - 
GATCC CAAGC TTAGTCCAGAATTGGTTC AC ACGGCTACGGAAAAACTATTAACAGAAAC 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA- - 
GATC CCAAGCTTAGTCCAGAATTGGTTCAC ACGGC TACGGAAAAACTATTAACAGAAAC 
GATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAAC 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 
GGTACTGGGGGCAATATTGG- TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA- - 
GATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAAC 
GATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAAC 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA- - 
GATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAAC 
GAT C C C AAG C T T AGTC C AG AAT TGGT T C AC ACGG C T AC GG AAAAAC TAT T AAC AGAAAC 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 

GGGATCTCCTGCTACTTCTACAATGTGAACAATCCGGA-ATTCTGTTTTCTGACTCTGA 
TATTCC AGTACAAACGATTGATATG CAGTGTGCTTCAT CAAGT - - TCTGCCTTGTTTTT 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT - - TCTGCCTTGTTTTT 
TATTC CAGTACAAACGAT TGATATGCAGTGTGCTTC ATCAAGT - - TCTGCCTTGTTTTT 
C ATAC T AAAATAT CGG AT T AT GAT G C AAT TGAAT GGAATGAAC CAT T T G C AG C GAT T GA 

TATTC 

CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT- -TCTGCCTTGTTTTT 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT- -TCTGCCTTGTTTTT 
CAT AC T AAAATAT C GG AT T ATG ATG C AAT TGAATGGAATGAAC CAT T TG C AG C G AT TG A 
CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT - - TCTGCCTTGTTTTT 
CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
CAT AC TAAAATATCGGATTATGATGC AATTGAATGGAATGAAC C ATTTGC AGCGAT TGA 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT - - TCTGCCTTGTTTTT 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



GCGTTAGAAATGC AG CAGC ATCGTGCATTAAACAAAC AT TTC - - CAATAGTGAGCAAAG 
GGT - TATCTAAAAATCAGTG - C CGGTATTAATGAAAAAGT TCTTGTTGGGGGGATTGAA 
GGT - TATCTAAAAATCAGTG - CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 

GGG - TATCTAAAAA 

GCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGC 

GCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGC 
GCTCTATTTAAT CATTATTATC CTGAAGAGAGAGAAAAAT TC AATATTTTTGGAGGGAC 
GGT - TAT C T AAAAAT C AGT A - C C G GT AT T AAT G AAAAAGT T C T T GT TGGGGGG AT T GAA 
GGT - TATCTAAAAATCAGTA- CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 
GCTCTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGAC 
GCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGC 
GGT - TATCTAAAAATCAGTG - CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 
GCTCT ATTTAATCATTATTAT CCTGAAGAGAGAGAAAAATT CAATATTTTTGGAGGGAC 
GCTCTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGC 
GGT - TATCTAAAAATCAGTA - CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 

TGAATTTTCCATCAATCTTGG--TAATTTTTGAAAAAATGTTTCTTTTAGTTTTCTAAC 
GTAGTTCTCTTCAACCTATGAGACGTTACGCTAAAGAAGATAATCGTAACGGAGAATAT 
GTAGTTCTCTTCAACCTATGAGACGTTACGCTAAAGAAGATAATCGTAACGGAGAATAT 



TTAGCTTACGGACAC C CTTAA - - TGC CTGC TCAGGAATTATTAATATC C 

TTAGCTTACGGACACCCTTA TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 

TTAGCTTACGGACACCCTTA - - - TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 
GTAGTTCTCTTCAACCTATGAGACGTTATGCTAAAGAAGATAATCGTAACGGAGAATAT 
GTAGTT CTCTTC AAC CTATGAGACGTTATGC TAAAGAAGAT AATCGTAACGGAGAATAT 
TTAGCTTACGGACACCCTTA- - -TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 
TTAGCTTACGGACACCCTTA- - -TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 
. GTAGTT CT CTTCAACCTATGAGACGTTACGCTAAAGAAGAT AATCGTAACGGAGAATAT 
TTAGCTTACGGACACCCTTA- - -TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 
TTAGCTTACGGACACCCTTA- - - TGCC TGCT CAGGAATTATTAATATC CTTCATCTTAT 
GTAGTTCTCTTCAACCTATGAGACGTTATGCTAAAGAAGATAATCGTAACGGAGAATAT 

CCTTGATCTCGCATCCCTTCCATTGGTAAGATTACYTCTTCTAAATAGCCACCTTGTTT 
CCGTTGCTCAGTTTTCTCCTGACTCTTATGCTG - - AAACTGTAATGTTAGAAGGGGCAC 
CCGTTGCTCAGTTTTCTCCTGACTCTTAKGCTG - - AAACTGTAATGTTAGAAGGGGCAC 

C AGGCATTAAAATATAAAAATAAAC C TATGGGC - CTAACTGCCATTGCAGGGGCA 

CAGGCATTAAAATATAAAAATAAACCTATGGGT - CTAACTGCCATTGCAGGGGCAG 

CAGTTGCTCAGTTTTCTCCTGACTCTTATGCTG - - AAACTGTAATGTTAGAAGGGGCCC 
CAGTTGCTCAGTTTTCTCCTGACTCTTATGCTG- -AAACTGTAATGTTAGAAGGGGCCC 
CAGGC ATTAAAATATAAAAATAAACC TATGGGT - C T AAC T G C CAT T GC AGGGG C AGG A - 

C AGGCATTAAAATATAAAAATAAAC CTATGGGTT CTAAC TGC 

CCGTTGCTCAGTTTTCTCCTGACTCTTATGCTG- -AAACTGTAATGTTAGA 

CAGGC ATTAAAATATAAAAATAAAC CTATGGGT - CTAACTGCCATTGCAGGGGC 

C AGGCATTAAAATATAAAAATAAAC C TATGGGT - CTAACTGC C ATTGC AGGGGCAGGA - 
TABCMARATVSTNCSRATNGTSAGTHAS 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



GCTGTTAAGGCGCGTTTATGGCTCAAGAATGCCAATTTATCTAACATTTCTCTTCTAAA 
AAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGA 
AAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGA 



GAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGA 



CCATATTTTTGACAGACTCTCTGGGCCCCTT - - CTAACATTACAGTTTCAGCATAAGAG 
CCATAAACGCGCCTTAACAGCTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAA 
CCATAAACGCGCCTTAACAGCTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAA 



CCATAAACGCGCCTTAACAGCTAAACA- 



CAGGAGAAAACTGAGCAACTGTATATTCTCCGTTACGATTATCTTCTTTAGCATAACGT 
GGAAGGGATGCGAGATCAAGGCGTTAGAAAACTAAAAGAAGCATTTTTTCAAAAATTAC 
GGAAGGGATGCGAGATCAAGGCGTTAGAAAACTAAAAGAAGCATTTTTTCAAAAATTAC 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



TC ATAGGTTGAAGAGAACTACTTTCAAT C C C C CC AAC AAGAACTTTTTC ATTAATAC CG 
AAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATG 
AAGATTGATGGRAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATG 



T AC TG AT T T T TAG AT AAC C AAAAAAC - - AAGG C AGAAC TTGATGAAGC AC ACTGCATAT 
TGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAG 
TGCTGCATTTCTWACGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAG 



AATCGTTTGTACTGGAATATAGGATTCATAATCAGAAAAAAGAGTCATCAAACGACCAA 
AGTAGC AGGAGAT CC C AAGCTTAGT CC AGAATTGGTTCACACGGC T ACGGAAAAACTAT 
AGTAGC AGGAGATC C CAAGCT TAGTC CAGAATTGGTT C ACACGGCTACGGAAAAACTAT 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ130 6 
SEQ1307 
SEQ13 08 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



ATTGCCCC C AGT ACCAACTGTGTTC C CACAAATAATACTAT C AATGTTAGATTCTGATT 
AACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTG 
AACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTG 

TATTTTTTTTATTTGATTTAAAAGGTGTGCTCCTAAAAGTTCTGGACGGTAAGTTTAAA 
AGCGATTGATGCTTTATTTAATCATTATTATC C TGAAGAGAGAGAAAAATT C AATATTT 
AGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTT 

TGCTT 

TGGAGGGGCATTAGCTTACGGACACCCTTATGCCTGCTCAGGAATTA 
TGGAGGGGCATTAGCTTACGGACACC C TTATGC CTGCTCAGG 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ ID NO. 1401: SAG0471 FROM THE 18RS21 GBS TYPE II STRAIN 

TTAAATTTGGTATCTTGACGCTTGAGGGAGAAGTAC^GAAAAATGGGCAATTGAGA 

TCTGATATCGTTGAATCTCTCAAACATCGTTTGAGC^ 

AGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTTTAATCTAA 

AAGTTGGAATTCCATTTTTTATTGATAACGATGCTAATGTTGC^ 

GTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGTTATCGC^ 

AATTGGGCATATGATTGTTGATCCAGAAAATGGATTTACGTGC^ 

GTGTTGTTAGAGTAGCACGTCAACTCGCAGAACAATATGAGG<j^ 

AGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTC 

AGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGT^ 

AGAAATACTTTGTC^CATTTGCTTTCCC^CMGTTAAAAAGTCMCTAAAATTAAGAT 



SEQ ID NO. 1402: SAG0471 FROM THE 090 GBS TYPE la STRAIN 

CGTTTCTGATATCGTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGGTATC 

CAGGAGCTGTTGATAGAACTAGTAAAACAGTAAGAGGTGCTTT^ 

AAAGAAGTTGGAATTCCATTTTTTATTGATA^ 

CGATGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGTTATCGCAGATGGTAACCTCATCCATGGTGTTGCA 
GAGAAATTGGX-JCATATGATTGTTGATCGAGAKAATGGATTTACGT^ 
ACAGGTGTTGTTAGAGTAGCACGTCAACTCGCAGAACAATATGAAGGTTC 
TACAAGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATAAATTTGCTO 

CAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGGTGGCGGTGTCTCAGCAG 
GTTGAGAAATACTTTGTCACATTTG 

SEQ ID NO. 1403: SAG0471 FROM THE COH1 GBS TYPE la STRAIN 

ACAAGAAAAATGGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATCGTTGA 

GCCTCTATGGATTAACAAAAGATGACTTTCTCGGTATCGGTAT 

GCTTTTAATCTAAATTGGGCTGATACTCAAGA 

SEQ ID NO. 1404: SAG0471 FROM THE CJB110 GBS NONTYPEABIiE STRAIN 

TTGGTATCTTGACGCTTGAGGAGAAGTACAAGAAA 

TCGTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGA^ 

GATAGAACTAGTAAAAC 

SEQ ID NO. 1405: SAG0471 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

CACCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGGTGGCGGTGTCTCAGCAGCA 
GTTGAGAAATACTTTGTCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTA 

SEQ ID NO. 1406: SAG0471 FROM THE 2603V/R GBS TYPE V STRAIN 

GGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTG^ 
TTAACAAAAGATGACTTTCTCGGTATCGGTATGGGTTCTCCAGGAGCTG 

SEQ ID NO. 1407: SAG0471 FROM THE H36b GBS TYPE lb STRAIN 

GGCAATTGAGACGAATACTTTAGAAAACGGAAG 

TAACAAAAGATGACTTTCTCGGTATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTTTAATCTA 

AATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGTTGGAATTCCATTTTTTATTGATAACGATGCTAATGTT 

ACTTGGTGAACGCTGGGTAGGTGCTGGTGCCAATAATCCCGACGTTGTTTTCGTAACC 

SEQ ID NO. 1408: SAG0471 FROM THE H36 GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

GAGACAGTTGCATCAGCGACAGGTGTTGTTAGAGTAGC& 

TGACAACGGTGATACTGTTACAAGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGT 

CACGTTACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGGTGGCGGTGTCTC^ 

GAATTTTTACGTAGTCGCGTTGAGAAATACTTTGTCACATTTGCTTTCCCACA 

SEQ ID NO. 1409: SAG0471 FROM THE M732 GBS TYPE III STRAIN 

ACAAGAAAAATGGGCAATTGAGACCATACTTAGAAAACGGAAGACATATCGTTTCTGATATCGTTGAA 
CTCTATGGATTAACAAAAGATGACTTTCTCGGTATCGGTATGGGTTCTCCA 

TTTTAATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAATTCCATTTTTTATTGATAACGATGCTA 

ATGTTGGAGCACTTGGTGAACGCTGGGTAGGTGCTGGTGCCAATAATCCCGATGTTGTTTTCGTAACCCTCGGAAC^ 

GGTGTTATCGCAGATGGTAACCTCIATCCATGGTGTTGCAAGAGCAGGTGGAGAAATTGGGCATATGATT 

SEQ ID NO. 1410: SAG0471 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

CAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTT^ 
ATTGCTGAACTAGGTAATGAT 

SEQ ID NO. 1411: SAG0471 FROM THE M781 GBS TYPE III STRAIN 

AGAAGTACAAGAAAATGGGCAATTGAGACCATACTTAGAAAACGGAAGACATATCGTTTCTGATATCGTTG 

TGAGCCT CTATGGATTAACAAAAGATGACTTTCT CGGTAT CGGTATGGGT TCT C CAGGAGCTGTTGATAGAACTAGTAAAACAGTAACA 
GGTGCTTTTAATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAATTCCATTTTTTATTGATAACGA 
TGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGCT 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ ID NO. 1412: SAG0471 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

GATACTGTTACAAGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATA^ 

TGGACTGGC^GCAGCTAATATTTCAAATATTTTAAACCCTGATTCTG 

GTAGTCGCGTTGAGAAATACTTTGTCACATTTGCTTTCCCACAAGTTAAAAA 

SEQ ID NO. 1413: SAG0471 FROM THE 090 GBS TYPE la STRAIN 

AAATTTGGTATCTTGACGCTTGAGGGAGAAGTACAAGAAAAATGGGCAT^ 
TATCGTTGAATCTCTCAAACATCGTTTGAGCCTCTA 

TTGATAGAACTAGTAAAAC^GTAACAGGTGCTTTTAATCTAAATTGGGCTGATACT 

GGAATTCCATTTTTTATTGATAACGATGCTAATGTTGCAGC^CTTGGTGAACGCTGGGTAGGTGCTGGTGCa^TAATCCCGACGTTGT 
TTTCGTAACC CTCGGAACAGGAGTAGGTGGAGG 

SEQ ID NO. 1414: SAG0471 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

GTGATACTGTTACAAGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGTTAC 

CTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTG^ 

ACGTAGTCGCGTTGAGAAATACTTTATCACATTTGCTTTCCCAC^^ 

SEQ ID NO. 1415: SAG0471 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE COMPLEMENT) 

GTTATCGCAGATGGTAACCTCATCCATGGTGTTGCAGGAGCAGGTGGA 
GTGCACATGTGGTAACAAAGGCTGCCTTGAGACAGT^ 

AGGGTTCGTCTGCCATTAAAGCAGCGATTGACCACGGTGATACTGTTACAAGTAAAGATATTTTTATAG 

TTTGCTAATTCTGTTGTTGAACGTGTATC^CGTTACCTTGGACTGGC^GCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGT 

TATTGGTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTA 

AGTCAACTAA 

SEQ ID NO. 1416: SAG0471 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE COMPLEMENT) 

TGGTATCTTGACGCTTGAGGGAGAAGTAC^G 

GTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGATTAACAAAAGATG^ 

TAGAACTAGTAAAACAGTGACAGGTGCTTTTAATCTAAATTGGGCTGATACTCAAGAAG 

TTCCATTTTTTATTG 

SEQ ID NO. 1417: SAG0471 FROM THE 2603V/R TYPE V GBS STRAIN (REVERSE COMPLEMENT) 

AGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGGT 
TTGAGAAATACTTTGTCACATTTGTTT TCCCACAAGGT 

SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ14 15 T T AT CG CAGATGGTAAC CTCAT C CATGGTGT TG CAGGAGCAGGTGGAGAAAT TGGG CAT 

SEQ1416 

SEQ1417 

SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 GAG 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ 14 14 

SEQ1415 TGATTGTTGATCCAGAAAATGGATTTACGTGCACATGTGGTAACAAAGGCTGCCTTGAG 

SEQ1416 

SEQ1417 --- 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ1401_ 

SBQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ14 08 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQX407 

SEQ14 08 

SEQ14 09 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



CAGTTGCATCAGCGACAGGTGTTGTTAGAGTAGC^CGTCAACTCGCAGAACAATATGAG 
C^GTTGCATC^GCGACAGGTGTTGTTAGAGTAGCACGTCAACTCGCAGAACAATATGAG 

GTTCGTCTGCCATTAAAGCAGCGATTGACAACGGTGATACTGTTACAAGTAAAGATATT 

GATACTGTTACAAGTAAAGATATT 

GTGATACTGTTACAAGTAAAGATATT 

GTTCGTCTGCCATTAAAGCAGCGATTGACCACGGTGATACTGTTACAAGTAAAGATATT 

TTAAATTTGGTATCTTGACGCTTGAGGGAGAAGTACAA 

ACAA 

TTGGTATCTTGACGCTTGAGG-AGAAGTACAA 

TTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGT 
ACAA 

AGAAGTACAA 

TTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGT 

AAATTTGGTATCTTGACGCTTGAGGGAGAAGTACAA 

TTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGT 
TTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGT 
TGGTATCTTGACGCTTGAGGGAGAAGTACAA 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ14 0 1_ AAAAATGGGCAATTGAGACCAATAGTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ1402 CGTTTCTGATATC 

SEQ14 0 3 AAAAATGGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ1404 AAAAATGGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ1405 CACCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ1406 GGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 07 GGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 0 8 ACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ140 9 AAAAAT G GG CAAT TGAGACCA- TACT T-AGAAAACGGAAGA CAT AT CGTTTCTGATATC 

SEQ1410 

SEQ1411 AAAA - TGGGCAATTGAGACCA - TACTT - AGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 12 ACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ14 1 3 AAAAATGGGCA- TTGAGACCA - TACTT - AGAAAACGGAAGACATATCGTTT CTGATATC 

SEQ14 14 ACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ14 1 5 ACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ1416 AAAAATGGGCAATTGAGACCA - TACTT -AGAAAACGGAAGACATATCGTTT CTGATATC 

SEQ1417 AGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ14 0 1_ TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 2 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 3 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 4 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 5 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 0 6 TTGAATCTCTC!A-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ1407 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 8 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 0 9 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 10 CAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 11 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 12 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 13 TTGAATCTCTCA-AAC^TCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 14 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 15 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ1416 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 17 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 0 1_ ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 0 2 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 0 3 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 04 ATCGGTATGGGGTCTCCAGGAGCTGTTGATAGAACTAGTAAAAC 

SEQ14 0 5 GTCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTA 

SEQ14 0 6 ATCGGTATGGGTTCTCCAGGAGCTG - ■ 

SEQ14 0 7 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 0 8 GTCACATTTGCTTTCCCACA 

S EQ14 0 9 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 10 GTCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTAAAATTAAGATTGCTGAACTAGG 

SEQ14 11 AT CG GT ATGGGT T C T C CAGGAG C T GT T GATAGAA C TAGTAAAA CAGTAACAGGT GC T T T 

SEQ14 12 GTCACATTTGCTTTCCCACAAGTTAAAAA 

SEQ14 13 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 14 ATCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTAAAATTAAGATTG 

SEQ14 15 GTCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTAA 

SEQ14 1 6 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTCACAGGTGCTTT 

SEQ1417 GTCACATTTGTTTTCCCACAAGGT 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQX401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGTTGGAAT 
AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAAT 
AATCTAAATTGGGCTGATACTCAAGA 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGTTGGAAT 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAAT 

AATGAT 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAAT 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGTTGGAAT 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGCTGGAAT 

CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 
CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 

CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 
CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 
CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 
CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 

CCATTTTTTATTG 

GGTGCCAATAATCCCGACGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGT 
GGTGCCAATAATCCCGATGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGT 

GGTGCCAATAATCCCGACGTTGTTTTCGTAACC 

GGTGCCAATAATCCCGATGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGT 

GGTGCCAATAATCCCGATGTTGTTTTCGTAACCCTCGGAACAGGAGTA 

GGTGCCAATAATCCCGACGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGG 
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SEQ1401_ 
SEQ1402 
SEQ1403 
SEQ1404 
SEQ1405 
SEQ1406 
SEQ1407 
SEQ1408 
SEQ1409 
SEQ1410 
SEQ1411 
SEQ1412 
SEQ1413 
SEQ1414 
SEQ1415 
SEQ1416 
SEQ1417 

SEQ1401 
SEQ1402" 
SEQ1403 
SEQ1404 
SEQ1405 
SEQ1406 
SEQ1407 
SEQ1408 
SEQ1409 
SEQ1410 
SEQ1411 
SEQ1412 
SEQ1413 
SEQ1414 
SEQ1415 
SEQ1416 
SEQ1417 

SEQ1401 
SEQ1402* 
SEQ1403 
SEQ1404 
SEQ1405 
SEQ1406 
SEQ1407 
SEQ1408 
SEQ140 9 
SEQ1410 
SEQ1411 
SEQ1412 
SEQ1413 
SEQ1414 
SEQ1415 
SEQ1416 
SEQ1417 



Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 

ATCGCAGATGGTAACCT(^TCCATGGTGTTGCAGGAGCAGGTGGAGAAATTGGGCATAT 
AT CG CAGATGGTAA C CT CAT C CATGGTGT TG CAGGAG CAGGTGGAGAAAT T GGG CATAT 



AT CG CAGATGGTAAC CT CAT C CAT GGTGT TG CAAGAG CAGGTGGAGAAATTGGG CAT AT 



ATTGTTGATCCA.GAAAATGGATTTACGTGCACIATGTGGTAACAAAGGCTGCCTTGAGAC 
ATTGTTGATCCL^GAKAATGGATTTACGTGCACATGTGGTAACAAAGGCTGTCTTGAGAC 



ATT 



GTTG^TC^GCGACAGGTGTTGTTAaA.GTAGCACGTCAACTCGCAGAACAATATGAAGG 



539 



WO 2004/018646 



PCT/US2003/026827 



Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



TCGTCTGCCATTAAAGCAGCGATTGACACCGGTGATACTGTTACAAGTAAAGATATTTT 
TCGTCTGCCATTAAAGC^GCGATTGACAACGGTGATACTGTTACAAGTAAAGATATTTT 



SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



ATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGTTA 
ATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGTTA 



SEQ1401_ 

SEQ1402~ 

SEQ14 03 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ140 8 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



CTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGG 
CTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGG 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ1401_ 

SEQ14 02 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQX411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415. 

SEQ1416 

SEQ1417 



GGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTTGTCAC 
GGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTTGTCAC 

TTTGCTTTCCCACAAGTTAAAAAGTCAACTAAAATTAAGAT 
TTTG ~ 
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Table 15: Comparative Sequences relating to SAG0492 



SEQ 10 NO. 1501: SAG0492 FROM THE 1169NT1 GBS NONTYPEABLE STRAIN 

TGACTTGGATATTCATCAAGGAGAAGTGGTGGTTATTATTG^ 
TGGAAGTACCAACAAAGGGAAC^GTGACTTTTGAAGGAATTGATAT^ 
GGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAA^ 
TAAGCTTGATGCTCAGACAAAAGCIATACGAGCTACTTGA 

GAGGACAACAACAACGGATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCC^ 

CCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTT 
TGCACGTGAAGTAGCGGATCGTGTCATTTTTATGGATGC^GGCATTATTGTGAGCAAGGGACCCCTAAGGAAGTAT 

SEQ ID NO. 1502: SAG0492 FROM THE 18RS21 GBS TYPE II STRAIN 

TTGGGAAAAATGAGGTTTTAAAAGGCATTGACTT 

TCAACATTTTTAAGAAC^TGAATCTCTTGGAAGTACCAAC^AAGGGA^ 

TGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATAT 

TATCACCTATTAAGACAAAGGGGCTTTCTAATCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAA 

GCTAATACTTATCCAGCTAGCTTATCTGGAGGACAACAACAACGAAT^ 

TTTTGATGAACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGA 
TGGTTATTGTCACTC^TGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTCATTTTTATGGACGCAGAAATTAT 

SEQ ID NO. 1503: SAG0492 FROM THE 2603V/R GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

AAAAATGAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGA^ 
ATTTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGG 
TTTTTAAAATGCGCGAAAAAATGGGCATGGTTTTTCAACAGT^ 
CCTATTAAGACAAAGGGGCTTTCTAATCTTGATGCTCAGACA^ 

TACTTATCC^GCTAGCTTATCTGGAGGAC^CAACAACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTG 
ATGAACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTT 
ATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTC^ 

SEQ ID NO. 1504: SAG0492 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

GAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTT 

AAGAACAATGAATCTCTTGGAAGTACCAACAAAGG 

AAATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAAATATTA 

AAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAAAAGT^ 

TCCAGCAAGCTTATCTGGAGGACAACAACAACGGATTGCTATTGCAAGAGG 

CTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTGTC 
ACTCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTC^TTTTTATGGATGaAGGGATTATTGTTGAGC^GGGACCCCTAAGAA 

AGTAT 

SEQ ID NO. 1505: SAG0492 FROM THE 090 GBS TYPE la STRAIN 

TGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAA 

ACTTTTGAAGGGATTGATATAAC^GAC^AAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGC^ 

ATTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTATTAAGACAAAGGGACTTTCTAAGCTTGAT 

ACGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAGGCTAATGCTTATCC^ 

GCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGA 

TGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTGTCACTGATG^ 

TTTTTATGGATGCAGGCATTATTGTTgAsCAAGGGACCCCTAAGGAAGTA 

SEQ ID NO. 1506: SAG0492 FROM THE A909 GBS TYPE la STRAIN 

CAATACAAGGACTTCATAAAAGTTTTGGGAAAAATGAGGTTTTAA 

ATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAATGAATCT^^ 

GATTGATATAACAGACAAAAAGAATGATATTTTTAAAATG 

TGACTGTACTAGAAAATATTACTTTATCACCTATTAAGACAAAGGGGCTTTCTAAGCTrGATGCTCAGACAAAAGCATATGAGCTACTT 
GAAAAAGTTGGACTCAAAGAGAAGGCTAATACTTATC(^GCTAGCTTATCTGGAGGACAACAACAA 

TGCAATGAATCCTGATGTCCTTCTTTTTGATGAACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCMG 

ATTTAGCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTCATTTTTAT 

GCAGGAATTATTGTgAGCAAGGGGCCCCTAAGGAAGTATTTGAGCAGACAA 

SEQ ID NO. 1507: SAG0492 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE COMPLEMENT) 

GACTTGGATATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAATGAATCTCTT 
GGAAGTAC CAA CAAAGG GAA CAGT GAC T T T TGAAGGGAT T GATAT AACAGACAAAAAGAATGATATT T T TAAAATGC G CGAAAAAAT GG 
GCATGGTTTTTCAACAGTTCAATCTATTTCCC^ 

AAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAAAAGTTGGA 

AGGACAACAACAACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGAT 

CTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTT 
GCACGTGAAGTAGCGGATCGTGTCTTTTTATGGATGCGGGAATTATTGTGAGCAAGGGACC 
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Table 15: Comparative Sequences relating to SAG0492 



SEQ ID NO. 1508: SAG0492 FROM THE H3 6b GBS TYPE lb STRAIN 

ATGAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAAGTAGTGGTTATT^ 
TTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAG 

TAAAATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAA 

TTAAGACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAAAAGTTGGACTCAAAGAG 

TATCC^GCTAGCTTATCTGGAGGACAACAACAACGAATTGCTATTGCAAGAGGTCTTGC^ 

ACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTG 

TCACTCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTCATTTTTATGGATGC^ 

GAAGTAT 

SEQ ID NO. 1509: SAG0492 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE COMPLEMENT) 

GGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAG 
GAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTG 

ATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAA^ 

GACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAAAAGTTGGA 

CAGCTAGCTTATCTGGAGGACAAGAACAACGAATTGCTATTGCAAGA(^ 

ACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATT^ 

TCATGAAATGGGTTTTG(^CGTGAAGTAGCGGATCGTGTCATTTT^ 

TATTTAGCAAAACAAAAGAAAT 

SEQ ID NO. 1510: SAG0492 FROM THE M732 GBS TYPE III STRAIN 

GGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTG 

CTTTTGAAGGGATTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTTCAATCTA 

TTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTATTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGA 

CGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAGGCTAATGCTTATCCAGCAAGCTTATCTGG 

SEQ ID NO. 1511: SAG0492 FROM THE COH1 GBS TYPE la STRAIN 

ATTGACTTGGATATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAATGAATCT 

CTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGATTGATATAACA 

TGGGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGA^ 

TCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAGGCTAATGCTTATCCAG™ 
TGG 
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Table 15: Comparative Sequences relating to SAG0492 

SEQ1501 TGACTTGG 

SEQ1502 TTGGGAAAAATGAGGTTTTAAAAGGCATTGACTTGG 

SEQ15 0 3 - - ■ AAAAATGAGGTT TTAAAAGGCATTGAC TTGG 

SEQ1504 GAGGTTTTAAAAGGCATTGACTTGG 

SEQ1505 

SEQ1506 AATACAAGGACTTCATAAAAGTTTTGGGAAAAATGAGGTTTTAAAAGGCATTGACTTGG 

SEQ1507 GACTTGG 

SEQ150 8 - ATGAGGTTTTAAAAGGCATTGACTTGG 

SEQ150 9 GGTTTTAAAAGGCATTGACTTGG 

SEQ1510 

SEQ1511 -ATTGACTTGG 

SEQ1501 TATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ1502 TATTCATCAAGGAGAAGTAGTGGTTATTATTGGC C C TTCTGGCTCTGGTAAGTCAACAT 

SEQ1503 TATTC ATCAAGGAGAAGTAGTGGTTATTATTGGC C C TTC TGGCTCTGGTAAGTCAAC AT 

SEQ15 0 4 T ATT CATC AAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ15 0 5 - ■ TGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ15 0 6 TATTC ATCAAGGAGAAGTAGTGGTTATTATTGGC CC TTC TGGCTC TGGTAAGTCAAC AT 

SEQ1507 TATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQX5 0 8 TATTCATCAAGGAGAAGTAGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ15 0 9 TATTCATCAAGGAGAAGTAGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ151 0 GGTGGT TAT TAT TGGCCCTTCTGGCT C TGGTAAGT C AAC AT 

SEQ1511 TATTCATC AAGGAGAAGTGGTGGTTATTATTGGC CCTTCTGGC TCTGGTAAGTCAACAT 

SEQ150 1 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGAA 

SEQ15 0 2 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 3 TTTTAAGAACAATGAATCT C TTGGAAGTACC AAC AAAGGGAAC AGTGAC TTTTGAAGGGA 

SEQ15 0 4 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 5 TTT TAAGAAC AATGAATCTCTTGGAAGTAC CAACAAAGGGAAC AGTGAC TTTTGAAGGGA 

SEQ15 0 6 TTTTAAGAACAATGAATCT C TTGGAAGTAC C AAC AAAGGGAACAGTGAC TTTTGAAGGGA 

SEQ1507 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 8 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 9 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ1510 TTTTAAGAAC AATGAAT C TCTTGGAAGTACC AAC AAAGGGAAC AGTGACTTTTGAAGGGA 

SEQ1511 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 1 TTGATATAACAGACAAAAAAAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ1502 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 03 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ1504 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 0 5 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 0 6 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ1507 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 0 8 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQX5 0 9 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ1510 TTGATAT AAC AGAC AAAAAGAATGATAT TT TTAAAATGCGCGAAAAAATGGGC ATGGTTT 

SEQ1511 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ150 1 TTCAAC AGTTC AATCTATTTC C CAATATGACTGTAC TAGAAAATATT ACTTTATC AC CTA 

SEQ15 0 2 TT C AACAGT T C AATC TATTTCC C AATATGAC TGTAC TAGAAAATATTACTTTATC AC CTA 

SEQ150 3 TTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTA 

SEQ150 4 TTC AAC AGTTC AATCTATTTC C CAATATGACTGTAC TAGAAAATATTACTTTATC AC C TA 

SEQ15 0 5 TT C AAC AGTT CAAT CTATTTC CC AATATGAC TGTACTAGAAAATATTAC TTTATCACCTA 

SEQ150 6 TTC AAC AGTTC AATCTAT TTC C CAATATGACTGTAC T AGAAAATATTACTTTAT C AC CTA 

SEQ15 0 7 TT C AAC AGTTCAAT CTATTTCC CAAT ATGACTGT ACT AGAAAATATTACT TTATCACCTA 

SEQ150 8 TTCAAC AGTTC AATCTATT T C CCAAT ATGACTGT ACTAGAAAATATTACTTTAT C AC C TA 

SEQ150 9 TTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTA 

SEQ151 0 TTC AAC AGTT.CAAT CTATTT C CCAAT ATGACTGT ACTAGAAAATATTAC TTTATCACCTA 

SEQ1511 TTC AAC AGTTC AAT C TAT TT C C CAATATGACTGTAC TAG AAAATATTACTT TATC AC CTA 
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Table 15: Comparative Sequences relating to SAG0492 



SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQ1511 



TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 
TTAAGACAAAGGGGCTTTCTAATCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGGCTTTCTAATCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 
TTAAGACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGC TTGATGCTC AGACAAAAGCATACGAGCTAC TTGAAA 
TTAAGACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 



SEQ1501 
SEQ15 02 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQ1511 



AAGTTGGACTCAAAGAGAAGGCTAATGCTTATCCAGCTAGCTTATCTGGAGGACAACAAC 
AAGT TGG AC T C AAAG AGAAGG C TAAT AC TT AT C C AG C TAG C T T AT C TGG AGG AC AAC AAC 
AAGT TGG AC T C AAAG AG AAGG C T AAT AC T TAT C C AG C TAG C T T AT CTGGAGG AC AA CAAC 
AAGT TGG AC T C AAAG AG AAGG C T AATG C T TAT C C AG C AAG C T TAT C TGGAGGAC AAC AAC 
AAGT TGGAC T C AAAG AG AAGG C T AATG C T TAT C C AGC TAG C T TAT C TGG AGGG C AAC AA C 
AAGTTGGACTCAAAGAGAAGGCTAATACTTATCCAGCTAGCTTATCTGGAGGACAACAAC 
AAGTTGGACTCAAAGAGAAGGCTAATGCTTATCCAGCTAGCTTATCTGGAGGACAACAAC 
AAGT TGGAC T C AAAGAGAAGG C TAAT AC T TAT C C AGC TAG C T TAT C TGGAGGAC AAC AAC 
AAGTTGGACTCAAAGAGAAGGCTAATACTTATCCAGCTAGCTTATCTGGAGGACAACAAC 

AAGTTGGACTCAAAGAGAAGGCTAATGCTTATCCAGCAAGCTTATCTGG 

AAGTTGGAC TCAAAGAGAAGGCTAATGCTTATCCAGC AAGCTTATC TGGTAB CMARATVS 



SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQX511 



ACGGATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTCATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGGATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 



NCSRATNGTSAG- 



SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQX511 



TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTCAG CTCTTGATC CTGAAATGGTAGGTGAAGTC TTGAC TGTTATGCAAGATTTAG 
TACT TC AGCT CTTGATC CTGAAATGGTAGGTGAAGTCTTGAC TGT TATGC AAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TAC TTCAGCTC TTGATCCTGAAATGGTAGGTGAAGT CTTGACTGTTATGC AAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTC AGCTCTTGAT CCTGAAATGGTAGGTGAAGTC TTGACTGT TATGC AAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 



SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQ1511 



TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
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Table 15: Comparative Sequences relating to SAG0492 



SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ15 08 
SEQ1509 
SEQ1510 
SEQ1511 



GGATCGTGTCATTTTTATGGATGCAGGCATTATTGT - GAGC AAGGGAC C CCTAAGGAAG 

GGATCGTGTCATTTTTATGGACGCAGAAATTAT ■ 

GGATCGTGTCATTTTTATGGATGCAGGAATTATTGTTGAGCAAGGGGCCC 

GGATCGTGTCATTTTTATGGATGCAGGGATTATTGTTGAGCAAGGGACCCCTAAGAAAG 
GGATCGTGTCATTTTTATGGATGCAGGCATTATTGTTGASCAAGGGACCCCTAAGGAAG 
GGATCGTGTCATTTTTATGGATGCAGGAATTATTGTGAGCAAGGGGCCCCTAAGGAAGT 

GGATCGTGTC -. TTTTTATGGATGCGGGAATTATTGT - GAGCAAGGGACC 

GGATCGTGTC ATTTTTATGGATGCASGAATTATTGTTGAGC AAGGGGCC C CTAAGGAAG 
GGAT CGTGTC ATTTTTATGGATGCAGGAATTATTGTTGAGC AAGGGGCC C CTAAGGAAG 



SEQ1501 
SEQ15 02 
SEQ1503 
SEQ1504 
SEQI505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQ1511 



AT- 



AT- 
A-- 



TTTGAGCAGACAAAAGAAATCCGCACAAGAGATTTCTT 



AT- 



ATTTAGCAAAACAAAAGAAAT - 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ ID NO. 1601: SAG0767 FROM THE M781 GBS TYPE III STRAIN 

TGGTCGCTCTGTCGGAACGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTT 
GTTAAAACTTATTTTATCACGCAAGTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAA 
GTTAATGACAAACCAAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTCC 
CCGTTTTACATGGACCAATGGGGGAAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACT 
AATATTCTATCTTCAAGCGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAGGTGTACCTCAGGTTGC 
ATATCAAACTTATTTTGAGGGTGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTG 
TAAAAC CGGCTAATATGGGGTC ATCAGTAGGTATTTC AAAAGCGACAGATGAATC CTC ACT TCGCT CTGCAATTGAC TTA 
GCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAA 
TGATGTTAAGACAACTTTTCCTGGCGAAGTTGTTAAAGACGTCGATTTCTATGACTATGACGCCAAATATATTGATAATA 
AAATTACTATGGATATTC C AGCTAAAGTTGATGAAGC AACTATGGAAGC AATGCGTCAATATGCAAGTAAAGC TTT TAAA 
GCAATCGGGGCTTGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATAC 
AATGCCCGGTTTTACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAATATGGGGCTAACTTATAGTGATTTGATTG 

SEQ ID NO. 1602: SAG0767 FROM THE 090 GBS TYPE la STRAIN 

AAACCGGGCATTGTATTCAGTTCGTTTAAGAAGACTTGTCCATCTTTCGTCAAAAAGAAATCACAGCGTGATAAACCACA 
AGCCCCGATTGCTTTAAAAGCTTTACTTGCATATTGACGCATTGCTTCCATAGTTGCTTCATCAACTTTAGCTGGAATAT 
C C ATAGTAATTTTATTATC AATATATTTGGCGT CATAGT CATAGAAAT CGACGTCT TTAACGACT TCGC CAGGAAAAGTT 
GTCTTAACATCATTATTGCCTAAAATACCTACTTCAATTTCACGAGCTGTCACGCCTTGTTCAATCAAAATACGGCTATC 
ATACTTGAGAGCTAAGTCAATksCAGAGCGAAGTGAGGATTCATCTGTCGCTTTTGAAATACCTACTGATGACCCCATAT 
TAGCCGGTTTTACAAAAATTGGGAAACTTAAAGTTTCTAAAGAGAGTTTAATCGCATGTTCCAAATCATCACCCTCAAAA 
TAAGTTTGATATGCAACCTGAGGTACACCTACTGTTGCAAGGACTTGTTTTGTTGTAATTTTATCCATAGCCACGCTTGA 
AGATAGAATATTAGTC C C AACATAAGGCATC CTTAAAACTTCTAAAAAT C CTTGGATAGAAC C AT CTTC C CC CATTGGTC 
CATGTAAAACGGGGAAAACAATTGCATTATCATCATAGATATCACTTGGACGAACCATTTTGTCTAAATCAACAGTTTGG 
TTTGTCATTAACTTTTCATCTGAAGATGGCATTTCATCAAATTCTTGTGTTTTAATAAATTGACCTACTTGCGTG 

SEQ ID NO. 1603: SAG0767 FROM THE COH1 TYPE la STRAIN 

TCGCTCTGCGGAACGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTA 
AAAC TTATTT TATC ACG C AAGTAGGTCAATT T ATTAAAACAC AAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 
ATGACAAAC C AAACTGTTGATTTAGAC AAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTC C C CGT 
TTTAC ATGGACC AATGGGGGAAGATGGTTCTATC CAAGGATT TTTAGAAGTTTTAAGGATGC CTTATGTTGGGACTAATA 
TTCTATCTTCAAGCGTGGCTAT 

SEQ ID NO. 1604: SAG0767 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

CGTCGATTTCTATGACTATGACGCCAAATATATTGATAATAAAATTACTATGGATATTCCAGCTAAAGTTGATGAAGCAA 
CTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTTCTTT 
TTGACGAAAGATGGAC AAATCTT C TTAAACGAACTGAATAC AATGCCC 

SEQ ID NO. 1605: SAG0767 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

AACGTGAAGTATCTGTACTGCTCTGCAGAAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATT 
TTATCACGCAAGTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAA 

SEQ ID NO. 1606: SAG0767 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

CTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCACTTCGCTCTGCAATTGACTTAGCTCTCAAG 
TATGATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAA 
GACAACTT TT C C TGGCGAAGTCGTTAAAGACGTCGATT TCT ATGAC TATGACGC C AAATATATTGATAATAAAATTACTA 
TGGATATTCCAGCTAAAGTTGATGAAGCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAGCAATCGGG 
GCTTGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATACAATGCCCGG 
TTTTACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAAT 

SEQ ID NO. 1607: SAG0767 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE COMPLEMENT) 

TTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTA 
GGCAATAATGATGTTAAGACAAC T TTTC CTGG CGAAGT CGTTAAAGACGTCGATTTCTATGACTATGACGC C AAAT ATAT 
TGAT AATAAAATTAC TATGGATATT C CAGCTAAAGT TGATGAAGC AAC TATGGAAGC AATGCGTC AATATGC AAGTAAAG 
CTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAA 
CTGAATACAATGCCCGGTTTTACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAAGTATGGGGCTAACCTT 

SEQ ID NO. 1608: SAG0767 FROM THE 18RS21 GBS TYPE II STRAIN 

ATCTGTACTGTCTGCAGAAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAA 
GTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAACTGTTGA 
TTTAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTCCCCGTTTTACATGGACCAATGGGGG 
AAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCTATCTTCAA 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ ID NO. 1609: SAG0767 FROM THE 2603V/R GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

GGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAGGTGTACCTCAGGTTGCATATCAAACTTATTTTGAGG 
GTGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTGTAAAACCGGCTAATATGGGG 
T(^TCAGTAGGTATTTO^AAGCGACAGATGAATCCTCACTTCGCTCTGC^TTGACTTAGCTCTCAAGTATGATAGCCG 
TATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 
CTGGCGAAGTCGTTAAAGACGTCGATTTCTATGACTATGACGCCAAATATATTGATAATAAAATTACTATGGATATTCCA 
GCTAAAGTTGATGAAGCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTT 
ATCACGCTGTGATTTCTTTTTGACGAAAGAATGGACAAATCTTCTTAAACGAACTGAAATAC 

SEQ ID NO. 1610: SAG0767 FROM THE 2603V/R <3BS TYPE V STRAIN 

TCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGT 
AGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAACTGTTGATT 
TAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAAT 

SEQ ID NO. 1611: SAG0767 FROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

AAAACCGGCTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCACTTCGCTCTGCAATTGACTTAG 
CTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAAT 
GATGTTAAGACAAC TTTTCCTGGCGAAGTCGTTAAAGACGTCGATT TC TATGACT ATGACGCC AAATATATTGATAATAA 
AATTACTATGGATATTCCAGCTAAAGTTGATGAAGCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAG 
CAATCGGGGCTTGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATACA 
ATGCCCGGTTTTACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAATATGGGGCTAACTTATAG 

SEQ ID NO. 1612: SAG0767 FROM THE H36b TYPE lb STRAIN 

CGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTAT 
CACGCAAGTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAA 
CTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTCCCCGTTTTACATGGACCA 
ATGGGGGAAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCTATCTTCAAG 
CGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAG 

SEQ ID NO. 1613: SAG0767 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

ATGCGATTAAACTCTCTTTAGAACCTTTAAGTTTCCCAATTTTTGTAAACCCGGCTAATATGGGGTCATCAGTAGGTATT 
TCAAAAGCGACAGATGAATCCTCACTTCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACA 
AGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTCCTGGCGAAGTTGTTA 
AAGACGTCGATTTCTATGACTATGACGCCAAATATATTGATAATAAAATTACTATGGATATTCCAGCTAAAGTTGATGAA 
GCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
CTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATACAATGCCCGGTTTTACTCAGTGGTCAATGTATCCTC 
TGCTTTGGGAAAATATGGGGCTAACTT 

SEQ ID NO. 1614: SAG0767 FROM THE M732 GBS TYPE III STRAIN 

GTC ATGCCGTGC TATTAATTATGATAAATTTT TTGTTAAAACTTATTTTAT CACGCAAGTAGGT CAATTTATTAAAACAC 
AAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAACTGTTGATTTAGACAAAATGGTTCGTCCA 
AGTGATATCTATGATGATAATGCAATTGTTTTCCCCGTTTTACATGGACCAATGGGGGAAGATGGTTCTATCCAAGGATT 
TTTAGAAGTTTTAAGGATGC C TTATGTTGGGACTAATATTCTATCTTC AAGCGTGGC TATGGATAAAATTAC AAC AAAAC 
AAGTCCTTGCAACAGTAGGTGTACCTCAGG 

SEQ ID NO. 1615: SAG07 67 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

TTTTGAGGGTGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTGTAAAACCGGCTA 
ATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCACTTCGCTCTGCAATTGACTTAGCTCTCAAGTAT 
GATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGAC 
AAC TTTTCC TGGCGAAGT CGTTAAAGACGTCGATTTCTATGACTATGACGC CAAATATATTGATAATAAAATTACTATGG 
ATAT TC CAGC TAAAGT TGATGAAGC AACTATGGAAGC AATGCGT CAATATGC AAGTAAAGCTTTTAAAGC AAT CGGGGCT 
TGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATACAATGCCCGGTTT 
TACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAATATGGGGCTAACTTATAGTGA 

SEQ ID NO. 1616: SAG0767 FROM THE A909 GBS TYPE la STRAIN 

TGGTCGCTCTGCGGAACGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTG 
TTAAAACTTATTTTATCACGCAAGTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAG 
TTAATGAC AAAC C AAAC TGTTGATTTAGACAAAATGGTTCGT C C AAGTGATATC TATG ATGATAATGC AATTGTTTTCC C 
CGTTTTACATGGACCAATGGGGGAAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTA 
ATATTC TAT CTT CAAGCGTGGCTATGGATAAAATTAC AAC AAAAC AAGTCC TTGCAACAGTAGG 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ ID NO. 1617: SAG0767 PROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE 
COMPLEMENT) 

AAGCAGGGGATACATTGACCACTGAGTAAAACCGGGCATTGTATTCAGTTCGTTTAAGAAGATCTGTCCATCTTTCGTCA 
AAAAGAAATCACAGCGTGATAAACCACAAGCCCCGATTGCTTTAAAAGCTTTACTTGCATATTGACGCATTGCTTCCATA 
.GATGCTTCATCAACTTTAGCTGGAATATCCATAGCAATTTTATTATCAATATATTTGGCG 



SEQ1601 GGTCGCTCTGTCGGAACGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTA 

SEQ1602 

SEQ1603 

SEQ1604 ' 

SEQ1605 

SEQ1606 

SEQ1607 

SEQ1608 

SEQ1609 

SEQ1610 

SEQ1611 

SEQ1612 

SEQ1613 

SEQ1614 

SEQ1615 

SEQ1616 

SEQ1617 



SEQ1 6 0 1 TAATTATGATAAATTT TTTGTTAAAACTTAT TTTATC ACGCAAGTAGGT C AATTTATTA 

SEQ1602 

SEQ1603 

SEQ1604 

SEQ1605 

SEQ1606 

SEQ1607 

SEQ1608 

SEQ1609 

SEQ1610 

SEQ1611 

SEQ1612 

SEQ1613 

SEQ1614 

SEQ1615 

SEQ1616 

SEQ1617 . 



SEQ1 601 AACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAACTG 

SEQ1602 

SEQ1603 

SEQ1604 

SEQ1605 

SEQ1606 

SEQ1607 

SEQ1608 

SEQ1609 

SEQ1610 

SEQ1611 

SEQ1612 

SEQ1613 

SEQ1614 

SEQ1615 

SEQ1616 

SEQ1617 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ16 04 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



TGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTCC 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ16X0 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



CGTTTTACATGGACCAATGGGGGAAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAA 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQX606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



GATGCCTTATGTTGGGACTAATATTCTATCTTCAAGCGTGGCTATGGATAAAATTACAA 



-GGCTATGGATAAAATTACAA 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ16 0 9 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQX611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



AAAACAAGT C CTTGC AACAGTAGGTGT AC CT CAGGTTGC ATATCAAAC TTATTTTGAGG 

AAAAC AAGT C CTTGC AAC AGTAGGTGTACC TCAGGTTGCATAT CAAACT TATTTTGAGG 

r TTTTGAGG 

TGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTG 

TGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTG 

ATGCGATTAAACTCTCTTTAGAACCTTTAAGTTTCCCAATTTTTG 

TGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTG 

AAAACCGGC TAATATGGGGTCAT C AGTAGGTATTTCAAAAGCGAC AGATGAATC C TC AC 

AAACCGGGC 

-TCGCTCTGCGGAACGTGAAGTATCTGTACTG- TCTGCAGAAA-GCGT 

AACGTGAAGTATCTGTACTGCTCTGCAGAAAAGCGT 

CTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCAC 

ATCTGTACTG - TCTGCAGAAAAGCGT 

AAAACCGGCTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCAC 

- -TCTGTACTG- TCTGCAGAAA-GCGT 

AAAACCGGCTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCAC 

CGTGAAGTATCTGTACTG- TCTGCAGAAA-GCGT 

AAAC C CGGCTAATATGGGGT C ATC AGTAGGTATTTC AAAAGCGACAGATGAATC CTC AC 

GT 

AAAAC CGGC TAATATGGGGTCAT C AGTAGGTATTTC AAAAGCGACAGATGAATC CTC AC 

TGGTCGCTCTGCGGAACGTGAAGTATCTGTACTG- TCTGCAGAAA- GCGT 

AAGCAGGGGATACATTGACCACTGAGTAAAACCGGGC 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 



TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 
TTGT - ATTCAGTTCGTTTAAGAAGACTTGTCCATCTTTCGTCAAAAAGAAATCACAGCG 
ATGC-GTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGTAG 

ATGC - GTGCTAT T AATTATGATAAAT TTTTTGTTAAAAC TTATT TT AT C ACGC AAGTAG 
TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

TTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

ATGC - GTGC TATTAAT TATGATAAATTTT T TGTTAAAAC TT ATTTTATC ACGCAAGTAG 
TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 

SEQl 610 ATGC - GTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGTAG 

SEQ1611 TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

SEQ1612 ATGC - GTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGTAG 

SEQ16 13 TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

SEQ1614 ATGC CGTGCTATTAATTATGATAAATTTT TTGTTAAAAC TTATTTTATCACGCAAGTAG 

SEQ1615 TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

SEQ1616 ATGC - GTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGTAG 

SEQ1617 TTGT-ATTCAGTTCGTTTAAGAAGATCTGTCCATCTTTCGTCAAAAAGAAATCACAGCG 

SEQ16 0 1 G AC AG C T C GTG AAAT-T G AAGT AGGTAT T TT AGG C AAT AAT G ATGT TAAG AC AAC T T T T C 

SEQ160 2 GATAAAC CACAAGC CCCGATTGCTTTAAAAGCTTTACTTGCATATTGACGCATTG 

SEQ1603 GTCAATTTATTAAA AC ACAAGAATTTGATGAAATGC C ATC TTCAGATGAAAAGTTA 

SEQ1604 

SEQ16 0 5 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAA 

SEQ1606 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ1607 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ16 0 8 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGC CATC TTCAGATGAAAAGTTA 

SEQl 6 0 9 , GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ16 1 0 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 

SEQ1611 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQl 612 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 

SEQ1613 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ16 14 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 

S EQ1 6 1 5 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ16 16 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 

SEQl 6 17 GATAAAC CACAAGC CCCGATTGCTTTAAAAGCTTTACTTGCATATTGACGCATTG 

SEQ1601 TGGCGAAGTTGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT - ATATTGATA 

SEQ16 0 2 TTCCATAGTT GCTT C AT CAACTTTAGCTGGAATATCCATAGTAATTTTATTATCA 

SEQ1603 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATG 

SEQ1604 CGTCGATTTCTATGA- - CTATGACGCCAAAT -ATATTGATA 

SEQ1605 

SEQ1606 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT - ATATTGATA 

SEQ1607 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT - ATATTGATA 

SEQ1608 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATG 

SEQ1609 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT- ATATTGATA 

SEQl 6 1 0 TGACAAACC AAACTGTTGATTTAGAC AAAATGGTT CGT C CAAGTGATATCTATG 

SEQ1611 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT -ATATTGATA 

SEQ16 12 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTC CAAGTGATATCTATG 

SEQ1613 TGGCGAAGTTGTTAAAGACGT CGATT TC TATGA - - CTATGACGCCAAAT - ATATTGATA 

SEQ16 14 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATG 

SEQ1615 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA - -CTATGACGCCAAAT- ATATTGATA 

SEQl 6 16 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTC CAAGTGATATCTATG 

SEQ16 17 TTCCATAGAT GCTTCATCAACTTTAGCTGGAATATCCATAGCAATTTTATTATCA 

SEQ1601 T AAAAT T AC TAT - -GGATATTCCAGCTAAAGTTGATGAAGCAACTATGGAAGCAATGCG 

SEQ1602 TATATTTGGCGTCATAGTCATAGAAATCGACGTCTTTAACGACTTCGCCAGG- - AAAAG 

SEQ1603 TGATAATGCAAT- - TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG - -ATGGT 

SEQl 6 0 4 TAAAAT TACTAT - - GGATATT CC AGC TAAAGTTGATGAAGC AAC TATGGAAGC AATG CG 

SEQ1605 

SEQl 6 0 6 TAAAATTACTAT - - GGATAT TC C AGC TAAAGTTGATGAAGCAACT ATGGAAGCAATGCG 

SEQl 607 TAAAATTACTAT - - GGATAT TC C AG CTAAAGTTGATGAAGCAACT ATGGAAGCAATGCG 

SEQ1608 TGATAATGCAAT- -TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG- -ATGGT 

SEQ160 9 TAAAATTACTAT- - GGATATTCCAGCTAAAGTTGATGAAGCAAC TATGGAAGC AATGCG 

SEQ1610 TGATAAT 

SEQ1611 TAAAATTACTAT- - GGATATTCC AGC TAAAGTTGATGAAGC AACTATGGAAGCAATGCG 

SEQ1612 TGATAATGCAAT- -TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG- - ATGGT 

SEQ1613 TAAAATTACTAT - - GGATATTCCAGCTAAAGTTGATGAAGCAACTATGGAAGCAATGCG 

SEQ1614 TGATAATGCAAT- -TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG- -ATGGT 

SEQ1615 TAAAATTACTAT-- GGATATTCC AGCTAAAGTTGATGAAGC AACTATGGAAGCAATGCG 

SEQ1616 TGATAATGCAAT- -TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG- - ATGGT 

SEQ1617 TATATTTGGCGT ABLECMPARATIVESEQENCESRELA-TINGTSAGD- -ALANI 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
T-- GTCTTAACATCATTATTGCCTAAAATACCTACTTCAATTTCACGAGCTGTCACGCC 
C- -TATCCAAG - -GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 



CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
C - - TATCCAAG - - GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 



CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
C- -TATCCAAG- -GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
C- -TATCCAAG- -GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
C- -TATCCAAG- -GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
E- -DALANINELIGASE 

TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 
TGTTCAATCAAAATACGGCTATCATACTTGAGAGCTAAGTCAATKSCAGAGCGAAGTGA 

TCTTCAAGCGTGGCTAT 

TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCC 



TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 
TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA -TACAATGCCCGGTTT 

TCTTCAA 

TTTTTGACGAAAGAA - TGGACAAATCTTCTTAAACGAACTGAAATAC 



TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 

TCTTCAAGCGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAG 

TTTTTGACGAAAGA - -TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 
TCTTCAAGCGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAGGTGT 
TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 
TCTTCAAGCGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAGG 



ACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAA - TATGGGGC TAACTTATAGTGATT 
GATTCATCTGTCGCTTTTGAAATACCTACTGATGACCCCATATTAGCCGGTTTTACAAA 



ACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAA- T 

ACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAAGTATGGGGCTAACCTT' 



ACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAA- TATGGGGCTAACTTATAG • 



ACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAA-TATGGGGCTAACTT 

CCTCAGG 

ACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAA - TATGGGGCTAACTTATAGTGA - - 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ16 04 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ16X6 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ16 08 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ16 03 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



GATTG 

ATTGGGAAACTTAAAGTTTCTAAAGAGAGTTTAATCGCATGTTCCAAATCATCACCCTC 

AAATAAGTTTGATATGCAACCTGAGGTACACCTACTGTTGCAAGGACTTGTTTTGTTGT 

ATTTTATCC ATAGC CACG CTTGAAGATAGAATATTAGTC C C AACATAAGGC ATCCTTAA 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ160 9 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



ACTTCTAAAAAT CC TTGGATAGAAC CAT CTTC C C C CATTGGTC CATGTAAAACGGGGAA 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



ACAATTGCATTATCATCATAGATATCACTTGGACGAACCATTTTGTCTAAATCAACAGT 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



TGGTTTGTCATTAACTTTTCATCTGAAGATGGCATTTCATCAAATTCTTGTGTTTTAAT 
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Table 16: Comparative Sequences relating to SAG0767 (D~alanine - D-alanine ligase) 



SEQ1601 

SEQ160 2 AATTGACCTACTTGCGTG 

SEQ1603 

SEQ1604 

SEQ1605 

SEQ1606 

SEQ1607 

SEQ1608 

SEQ1609 

SEQ1610 

SEQ1611 

SEQ1612 

SEQ1613 

SEQ1614 

SEQ1615 

SEQ1616 

SEQ1617 
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Table 17: Comparative Sequences relating to SAG1086 (xanthine phophoribosyltransf erase) 



SEQ ID NO. 1701: SAG1086 FROM THE1169NT1 GBS NONTYPEABLE STRAIN 

TTTAAAGGTTGATTCCTTTTTGACTCATCAGGTAGATTTTGAGTTAATGCAGGAAAT^ 
CCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGCGCCAGCAGTGTACGCAGCTC^ 
GCTAAAAAGGCTAAGAAGATTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCT 
TATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACT 

AAATTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAA 
ACAGGTGTTCCAGT 

SEQ ID NO. 1702: SAG0767 FROM THE 18RS21 GBS TYPE II STRAIN 

ATAAATATAAAGAAGCCGGCATTACGAAGGTTGTTACGATT 

GkACCAATGATATTTGCTAAAAAAGCTAAGAACATTACTATGACTGAAGGTATCTTAACTG^ 
TACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTG^ 

CTAAAGGATTACTTGAAATTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCC^AGATGGGCGT 
GATT TGTTAGAAAAAACA 

SEQ ID NO. 1703: SAG0767 FROM THE H36bl GBS TYPE lb STRAIN 

AAGAACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTT 
ATGCAGGAAATAGGTAAAGTTTTTGCTGATAAATATAAAGAAGCCGGCATT^ 

AGCAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACATTACTATG 
CTGAAGTGTATTCTTTTACAAAGCAAGTTACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATCT^ 
GATGACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATTATTGGTCAAGCTGGAGCTAA 
TATTGAAAAATCTTTCCAAGATGGGCGTGATT 

SEQ ID NO- 1704: SAG0767 FROM THE M732 GBS TYPE III STRAIN 

AGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAGCAGTGTACGCAGCTCAA^ 

AAGAACATTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGC^ 

CTTTTTATCTAACGATGATACTGTACTCATCATTGATGACTTTTTAGCAAACGGTCAAGCG^ 

AAGCTGAAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTGTTAGAAAAAACAGGTG 
GTTACTTCTCTTGCTCGT 

SEQ ID NO. 1705: SAG0767 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

GAACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCT^ 
GCAGGAAATAGGTAAAGTTTTTGCTGATAAA.TA 

CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAAC^TTA 

GAAGTGTATTCTTTTACAAAGCAAGTTACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTC^T(^TTGA 
TGACTTTTTAACAAACGGTCAAGC 

SEQ ID NO- 1706: SAG0767 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

ACATTTTAAAGGTTGATTCTTTTTTGACTCATCAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCT 

GAAGCCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGCACCA^ 

ATTTGCTAAAAAAGCTAAGAACATTACTATGACTGAAGGTATCTTAACTGCT 

TTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATGACTTTTTAGCAAACMGTCYAGCGGCTAAAGGATTA 

CTTGAAATTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCT^ 

AAA 

SEQ ID NO. 1707: SAG0767 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

ACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTTTTTTGACTCATCAGGTAGATTTTGAGTTAATGC 
AGGAAATAGGTAAAGTTTTTGCTGATAAATATAAAGAAGCCGGCATTACGAAGGT 
- GTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTA 

AGTGTATTCTTTTACAAAGCAAGTTACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
ACTTTTTAGCAAACGGKCAAGCGGSTAAAGGATTACTTGAAATTATTGGTCAAGCTGGAGCTA 

SEQ ID NO. 1708: SAG0767 FROM THE COH1 GBS TYPE la STRAIN 

TTTAAAAGTTGATTCTTTTTTGACTCATCAGGTAAATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATAAATATAAAGAAG 
CCGGCATTACGAAGGTTGTTACAATTGAAGCATCTC 

GCTAAAAAAGCTAAGAACATTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTAT^ 
TATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATGACT^ 

AAATTATTGGTCAAGCTGAAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTGTTA 
ACAGGTGTTCCGGTTAC 

SEQ ID NO. 1709: SAG0767 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

GCTGATAAATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGA 

GGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACIATTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGC 
AAGTTACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATGACTTTTTAGCAAACGGTCA^ 
GCGGCTAAAGGATTACTTGAAATTTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATG 
GGCGTGATTTGTTAGAAAAAACAGGTGTTCCAGT 
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Table 17: Comparative Sequences relating to SAG1086 (xanthine phophoribosyltransf erase) 



SEQ ID NO. 1710: SAG0767 PROM THE 2603 V/R.GBS TYPE V STRAIN 

AACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTTTTTTGACTra 

CAGGAAATAGGTAAAGTTTTTGCTGATAAATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTC 
AGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAG 

AAGTGTATTCTTTTACAAAGCAAGTTACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTC^ 

GACTTTTTAGCAAACGGTC^GCGGCTAAAGGATTACTTGAAATTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAATCGTTAT 

TGAAAAATCTTTCCAAGATGGGCGTGATTTGTTAGAAAAAACAGGTGTTCCAG 

SEQ ID NO. 1711: SAG0767 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE 
COMPLEMENT) 

ACGAAGGTTGTTACAATTGAAGCATCTGGAATTG 

AGCTAAGAACATTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGAGTCAAGTTTCrATTGTGA 
GTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATGACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATTATT 
GGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGA 



SEQ17 01 TTTAAAGGTTGATTCCT 

SEQ1702 TTTAGGTGAGAACATTTTAAAGGTTGATTCTT 

SEQ17 03 AGAACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTT 

SEQ1704 ATTCT 

SEQ1705 -GAACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTT 

SEQ17 06 ACATTTTAAAGGTTGATTCTT 

SEQ1707 ACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTT 

SEQ17 08 TTTAAAAGTTGATTCTT 

SEQ1709 

SEQ17 10 - - AACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTT 

SEQ1711 

SEQ17 0 1 TTTGAC TC ATCAGGTAGATT TTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 2 TTTGACT CATCAGGTAGATT TTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 3 TTTGACTCATCAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 4 TTTTGACTATCAGGTAAATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ1705 TTTGAC T C ATCAGGTAAATTTTGAGT TAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 6 TTTGACTCATCAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

• SEQ1707 TTTGACTCATCAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 8 TTTGACTCATCAGGTAAATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 09 GCTGATA 

SEQ17 1 0 TTTGACT CAT CAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ1711 

SEQ17 0 1 AT AT AAAG AAG C CGG CAT T ACG AAG GT TGT TACG AT TGAAG CAT C T GGAAT T GC G C C AG 

SEQ17 0 2 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGCACCAG 

SEQ17 0 3 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAG 

SEQ1704 ATATAAAGAAGC CGGC ATTACGAAGGT TGTTACAATTGAAGC AT CTGGAATTGCGCC AG 

SEQ17 0 5 ATATAAAGAAGC CGGC AT TACGAAGGTTGTTACAATTGAAGC AT CTGGAATTGCGCCAG 

SEQ17 0 6 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGCACCAG 

SEQ1707 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAG 

SEQ17 0 8 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAG 

SEQ17 0 9 ATATAAAGAAGCCGGC AT TACGAAGGTTGTTACAATTGAAGC AT CTGGAATTGCGC CAG 

SEQ1710 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAG 

SEQ1711 ACGAAGGTTGTTACAATTGAAGC AT CTGGAATTGCGC CAG 

SEQ17 0 1 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAGGCTAAGAACA 

SEQ17 0 2 CAGTGTACGCAGCTCAAGCATTGGGCGKACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ17 0 3 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ1704 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ17 0 5 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ17 0 6 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ1707 CAGTGTACGCAGCTCAAGCATTGGGCGTACGAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ 17 0 8 C AGTGTACGCAGC T CAAGC AT TGGGCGTACCAATGATATTTGCTAAAAAAGC TAAGAAC A 

SEQ17 0 9 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ17 1 0 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ1711 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 



558 



WO 2004/018646 



PCT/US2003/026827 



Table 17: Comparative Sequences relating to SAG1086 (xanthine phophoribosyltransferase) 



SEQ1701 
SEQ1702 
SEQ1703 
SEQ1704 
SEQ1705 
SEQ1706 
SEQ1707 
SEQ1708 
SEQ1709 
SEQ1710 
SEQ1711 

SEQ1701 
SEQ1702 
SEQ17 03 
SEQ17 04 
SEQ1705 
SEQ170 6 
SEQ1707 
SEQ1708 
SEQ1709 
SEQ1710 
SEQ1711 

SEQ17 01 
SEQ1702 
SEQ1703 
SEQ1704 
SEQ1705 
SEQ170 6 
SEQ1707 
SEQ1708 
SEQ1709 
SEQ1710 
SEQ1711 

SEQ1701 
SEQ1702 
SEQ1703 
SEQ1704 
SEQ17 05 
SEQ1706 
SEQ1707 
SEQ1708 
SEQ1709 
SEQ1710 
SEQ1711 

SEQ17 01 
SEQ1702 
SEQ1703 
SEQ1704 
SEQ1705 
SEQ1706 
SEQ1707 
SEQ17 08 
SEQ1709 
SEQ1710 
SEQ1711 



TTACTATGACTGAAGGTATC TTAACTGCTGAAGTGTATTC TTTTACAAAGCAAGWTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATRTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTAC TATGACTGAAGGTATC T TAAC TGCTGAAGTGT ATTC TTTTACAAAGCAAGTTACGA 
TTACTATGAC TGAAGGTATC T TAAC TG CTGAAGTGTATTC TTTTACAAAGCAAGTTACGA 
TTACTATGAC TGAAGGTATCT TAACTGCTGAAGTGTATTC T TTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATC T TAAC TGCTGAAGTGTATTC TTTTACAAAGCAAGTTACGA 

GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 

ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT- ATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT - ATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT-ATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTC AAGCGGC TAAAGGATTACTTGAAATT - ATTGGTCAAGCTGAA 

ACTTTTTAACAAACGGTCAAGC 

ACTTTTTAGCAAACMGTCYAGCGGC TAAAGGATTACTTGAAATT - ATTGGT CAAGCTGGA 
ACTTTTTAGCAAACGGKCAAGCGGS TAAAGGATTACTTGAAATT - ATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT- ATTGGTCAAGCTGAA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATTTATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT - ATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT - ATTGGTCAAGCTGGA 

CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 
CTAAGGTTGC TGGTAT CGGAATCGTTATTGAAAAATC T TT C CAAGATGGGCGTGATTTG 
CTAAGGTTGCTGGTATCGGAATCYTTATTGAAAAATCTTTCCAAGATGGGCGTGATT-- 
CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 

CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 

CTA 

C TAAGGTTGCTGGTATCGGAATC GTTATTGAAAAATCTTTC CAAGATGGGCGTGATTTG 
CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 
CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 
CTAAGGTTGC TGGTATCGGA TAB CMARAT VS TN C S R - - ATNGTS AGXANTHN 

TAGAAAAAACAGGTGTTCCAGT 

TAGAAAAAACA 

TAGAAAAAACAGGTGTTCCGGTTACTTCTCTTGCTCGT 

- TAGAAAA 

TAGAAAAAACAGGTGTTCCGGTTAC 

TAGAAAAAACAGGTGTTCCAGT 

TAGAAAAAACAGGTGTTCC AG 

HRBSYTRANSRAS 



559 



WO 2004/018646 



PCT/US2003/026827 



Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ ID NO. 1801; SAG1600 PROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

AATCTTCATTGGAGATCAC^CTAGAGCTCCGTATGGTCCT 

TATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTC 

CCTGTTTTAGGCGTTATTTTACCAGGAGCTAGCGCAGCTATCAAATCAAC^ 

TGTTAAAT(^GATGCTTATCGTCAAAAAATTCAAG 

TTGTGGAATCAAATCAGATGTCTTCTAGTTTAGCCAAAAAGGTGGTTTATGAAACGTTGTCCCCATTAGTTGGTAAATTAGATACT 

ATTTTAGGTTGCACGCATTATCCCTTATTACGTCCCAT<^^ 

AACCGTTCGTG&TATTTCTGTTTTATTGAACTAT^ 

CCGCCAGCCCAA 

SEQ ID NO. 1802: SAG1600 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

AAATGTTCCGTCAACTTCCAGAAGAGGAAGTAATCTTCM^ 

AGAGAGTTTACCTGGCAGATGGTTAACTTCTTATTGACTAAAAATGTTAAGATGATTGTTAT^^ 
CTGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTT^ 

GGAAAGTTGGTATTATAGGTACTCCCATGACTGTTAAATCAGATGCTTATCGTCAAAAAATTCA^ 

GTATCCCTTGCTTGTCCGAAATTTGTTCCAATTGTGGAATCAA^ 

GTCCCCIATTAGTTGGTAAATTAGATACTTTAATTTTAGGTTC 

CTGAGGTTAAATTAATTG^TAGTGGCGCAGAAACCGTTCGTGATATTTCTGTTTTATTGAACTATTTTGAGATAA 
AATAAACACGGTGGTC^TCACTTTTACACMCCGCC^GCCCAAAAGGTTTTAAAGAAA 

SEQ ID NO. 1803: SAG1600 FROM THE 090 GBS TYPE la STRAIN 

AATCTTCATTGGAGACCAGGCTAGAGCTCCGTATGGTCCTAGAC 

ATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGCCTGGGAA 
CTGTTTTAGGCGTTATTTTACCAGGAGCTAGCGCAGCTATCAAATCAACTAATTCAGGGA 

GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGTATCCCTTGCTTGTCCGAAATTTGTTC 
TGTGGAATCAAATCAGATGTCTTCTAGTTTAGCCAAAAAGGTGGTTTATGAAACGCTG 

TTTTAGGTTGCACGCATTATCCCTTATTACGTCCCATCATTCAAAATGTTATGGGGGCTGAGGTTAAATTAATTGATAGTGGCGC^ 

ACCGTTCGTGATATTTCTGTTTTATTGAACTATTTTGAGATaAmCCATaATTGGsraAAATAAAGACGG 

CGsCAGCCCAAAAGGTTTTTAAGGAAATTGCAGAACAATGGCTTAATCAAGAAATAAAT 



SEQ ID NO. 1804: SAG1600 FROM THE A909 GBS TYPE la STRAIN 

GCGGTTGTGTAAAAGTGATGACCACCGTGTTTATTTTGCCAATTATGGTTTATC^ 
GGTTTCTGCGCCLACTATGAATTAATTTAACCTCAGCCCCGATAACATTTTGAATGATGGG 
AAATTAAAGTATCTAATTTACCAACTAATGGGGA 
ACAATTGGAACAAATTTCGGACAAGCAAGGGATACCACAGC^ 

AACAGTCATGGGAGTACCTATAATACCAACTTTCCCTAAATTAGTTGATTTGATAGCTGCGCTAGCTCCTGGTAAAATAACGCCTAAAA 
(^GGGATGTCTAGTTTTTCTTTAATTTCTTGCCAGGCAACTG 
AATAAGAAGTTAACCATCTGCCAGGTAAACTCTCTAATCTGTT^ 
GATTACTTCCTCTTCTGGAAGTTGACGGAACATTTCCTTAACAACCGTTAAACCACCT 

SEQ ID NO. 1805: SAG1600 FROM THE COHl GBS TYPE la STRAIN 

TTCCGTCAACTTCCAAAATATGAAGTAATCTTCAT^ 

GTTTACCTGGCAGATGGTTAACTTCTTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATAQ^GCAACTGCAGTTGCCTGGC 

AAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTTATTTTACCAGGAGCTAGCGCAGCTATCA 

GTTGGTATTATAGGTACTCCCATGACTGTTAAATCAGATGCTTATC^ 

CCTTGCTTGTCCGAAAT 

SEQ ID NO. 1806: SAG1600 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

GTAATCTTCATTGGAGATCAGGCTAGAGCTCCGTATGGTCCTAGACCTGCTCAACAGATTAGAGAGTTTACCTGGCAGATGGTTAATTT 
CTTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGCCTGGCAAGAAATTAAAGAAAAA 

TAC 

SEQ ID NO. 1807: SAG1600 FROM THE 11S9NT1 GBS TYPE V STRAIN 

CTTTTGGGCTGGCGGTTGTGTAAAATTGATGACCACCGT 

ATATCACGAACGGTTTCTGCGCCACTATCAATTAATTTAACCTCAGCCCCCATAACATTTT 

C GTG CAA C CT AAAAT TAAAGTAT C TAAT T TAC CAA CT AATGGGGACAATG TTT CATAAAC CAC C TTTTTGGC TAAAC TAGAAGA CAT C T 

GATTTGATTCCACAATTGGAACAAATTTCGGACAAGCAAGGGATACCIAC^ 

GCATCTGATTTAACAGT CATGGGAGTACCTATAA 

SEQ ID NO. 1808: SAG1600 FROM THE 1169NT1 GBS TYPE V STRAIN 

GTAATCTTCATTGGGGATCAGGCTAGAGCTCCGTATGGTCCTAGACCTGCTC^ 
CTTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTT 

SEQ ID NO. 1809: SAG1600 FROM THE 18RS21 GBS TYPE II STRAIN 

GAAATGTTCCGTCAACTTCCAGAAGAGGAAGTAATCTTCATTGGAGATCAGGCTAGAGCT^ 

TAGAGAGTTTACCTGGCAGATGGTTAACTTCTTATTGACTAAAAATGTTAAGAT<^ 

CCTGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTT^ 

GGGAAAGTTGGTATTATAGGTACTCCCATGACTGTTAAATCAGATGCTTATCGTCAAAAAATTCAAGC 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ ID NO. 1810: SAG1600 FROM THE 18RS21 TYPE II STRAIN 

ATTTCTTTAAAACCTTTTGGGCTGGCGGTTGTGTAATATTGATGACCACCGTGTTTATTTTGCCAATTATGGTTTATCTCAAAATAGTT 

CAATAAAACAGAAATATCACGAACGGTTTCTGCGCCACTATCAATTA^ 

ATATGGGATAATGCGTGCAACCTAAAATTAAAGTA 

SEQ ID NO. 1811: SAG1600 FROM THE 2603 V/R GBS TYPE V STRAIN 

ATTTCTTTAAAACCTTTTGGGCTGGCGGTTGTGTAATAAGTGATGACCACCGTGTTTATTTTGCCAATTATGGTTTATCTCAAAATAGT 
TCAATAAAAGAGAAATATCACGAACGGTTTCTGCGC 

AATAGGGGATAATGCGTGCAACCTAAAATTAAAGTATCTAATTTACCAACTAATGGGG^ 
ACTAGAAGACATCTGATTTGATTCCACAATTGGAACAA 

SEQ ID NO. 1812: SAG1600 FROM THE M7 81 GBS TYPE III STRAIN 

GGCGGTTGTGTAAAAGTGATGACCACCGTGTTTATTTTGCCAATTATGGTTTATCTCAAAATAGTTCAATAAAACAGAAATAT 
CGGTTTCTGCGCCACTATGAATTAATT^ 

AAAATTAAAGTATCTAATTTACC^CTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAAACTAGAAGA 

SEQ ID NO. 1813: SAG1600 FROM THE M 781 GBS TYPE III STRAIN 

AATCTTGATTGGAGATCAGGCTAGAGCTCCGTATGGTCCTAGAC^ 
TATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGC 

SEQ ID NO. 1814: SAG1600 FROM THE JM9130013 GS TYPE VIII STRAIN 

TGGGCTGGCGX3TTGTGTAAAAGTGATGACCACCGTGTTTATT 

CACGAACGGTTTCTGCGCCACTATCAATTAATTTAACCTCAGCCCCCAT^ 

CAACCTAAAATTAAAGTATCTAATTTACGAACTAATGGGGACAAC^ 

TGATTCCACAATTGGAACAAATTTCGGACAAGCAAGGGATACCA 

CTGATTTAACAGTCATGGGAGTACCTATAATACCAACTTTCCCTGAA 



SEQ1801 AATCTTCATTGGAGATCAGGCTAGAGCT 

SEQ1 802 AAATGTT C CGTC AACTTC CAGAAGAGGAAGTAATCTTCATTGGAGATCAGGCTAGAGC T 

SEQ1803 AATCTTCATTGGAGACCAGGCTAGAGCT 

SEQ1804 GCGGTTGTGTAAAAG-T 

SEQ 1805 TTC CGTCAAC TTC CAAAATATGAAGTAATCTTCAT TGGAGATC AGGCTAGAGCT 

SEQ1 806 GTAATC TTCAT TGGAGATC AGGCTAGAGCT 

SEQ1807 CTTTTGGGCTGGCGGTTGTGTAAAAT - T 

SEQ1 808 GTAATCTTCATTGGGGATCAGGCTAGAGCT 

SEQ18 0 9 AAATGTT CCGT C AACTT C C AGAAGAGGAAGTAATCTTCATTGGAGATC AGGC TAGAGC T 

SEQ1810 ATTTCTTTAAAACCTTTTGGGCTGGCGGTTGTGTAATAT - T 

SEQ1811 ATTTCTTTAAAACCTTTTGGGCTGGCGGTTGTGTAATAAGT 

SEQ1812 GGCGGTTGTGTAAAAG-T 

SEQ18 13 AATCTTCATTGGAGATCAGGCTAGAGCT 

SEQ1814 - TGGGCTGGCGGTTGTGTAAAAG-T 

SEQ18 0 1 CGTATGGTC - CTAGAC CTGCT C AAC AGATTAGAGAGTTTAC CTGGC AGATGGTTAATTT 

SEQ18 0 2 CGTATGGTC - CTAGAC C TGCT C AAC AGATTAGAGAGTTTACC TGGC AGATGGTTAACT T 

SEQ1803 CGTATGGTC - CTAGAC CTGCTCAACAGATTAGAGAGTT- AC CTGGC AGATGGTTAATTT 

SEQ1804 - - GATGACCACCGTGTTTATTTTGCCAATTATGG - - TTTATCTCA - AAATAGTTCA 

SEQ18 0 5 CGTATGGTC - CTAGACCTGCTCAACAGATTAGAGAGTTTACCTGGCAGATGGTTAACTT 

SEQ18 0 6 CGTATGGTC - C TAGAC CTGCTC AAC AGATTAGAGAGTTTAC CTGGCAGATGGTTAATTT 

SEQ1807 - - GATGACCACCGTGTTTATTTTGCCAATTATGG - -TTTATCTCA -AAATAGTTCA 

SEQ 1 8 0 8 CGTATGGTC - CTAGAC C TGCTC AAC AGATTAGAGAGTT TAC CTGGC AGATGGTTAATTT 

SEQ1809 CGTATGGTC - CTAGAC C TGCT C AAC AGATTAGAGAGTTTAC CTGGC AGATGGTTAAC TT 

SEQ1810 - -GATGACCACCGTGTTTATTTTGCCAATTATGG- -TTTATCTCA- AAATAGTTCA 

SEQ1811 - -GATGACCACCGTGTTTATTTTGCCAATTATGG- - TTTATCTCA- AAATAGTTCA 

SEQ1812 - -GATGACCACCGTGTTTATTTTGCCAATTATGG- -TTTATCTCA- AAATAGTTCA 

SEQ1813 CGTATGGTC - CTAGAC CTGCTC AAC AGATTAGAGAGTTTAC CTGGC AGATGGTTAACTT 

SEQ1814 - -GATGACCACCGTGTTTATTTTGCCAATTATGG- -TTTATCTCA -AAATAGTTCA 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 

SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 

- - ATAAAACAGAAATATCACGAACGGT -TTCTGCGCCACTATCAATTAATTTAACCTCA 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGC TTGTAATACAGCAACTGCAGTT - - 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 
TTATTGAC TAAAAATGTTAAGATGATTGTTATAGC TTGTAATACAGCAACTGC 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 

TGGCAAGAAATTAAAGAAAAACTAGACGTGCCTGTTTTAGGCGTTATTTTACCAGGAGC 
TGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTTATTTTACCAGGAGC 
TGGCAAGAAATTAAAGAAAAACTAGACATACCTGTTTTAGGCGTTATTTTACCAGGAGC 
CCCCCATAACATTTTGAATGATGGGACGTAATAGGGGATAATGC - GTGCAACCTAAAAT 
TGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTTATTTTACCAGGAGC 

TGGC AAGAAATTAAAGAAAAAC TAGACATAC 

C CCC C ATAACATTTTGAATAATGGGACGTAATAGGGGATAATGC - GTGCAACCTAAAAT 

TGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTTATTTTACCAGGAGC 
C C C C CATAAC ATTTTGAATGATGGGACGTAATATGGGATAATGC - GTGCAACCTAAAAT 
CC C C C ATAAC ATTTTGAATGATGGGACGTAATAGGGGATAATGC - GTGCAACCTAAAAT 
CCCCCATAACATTTTGAATGATGGGACGTAATAGGGGATAATGC - GTGCAACCTAAAAT 

C CC C C ATAACATTTTGAATGATGGGACGT AATAAGGGATAATGC - GTGCAACCTAAAAT 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 

SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ18 0 8 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



AG C G C AG C TAT C AAAT C AAC TAATT C AGGG AAAGT TGGT AT T AT AGGT AC T C C C ATG AC 
AGCGC AGC TATCAAATCAAC TAAT TTAGGGAAAGTTGGTATTATAGGTAC TCCCATGAC 
AGCGC AGC TATCAAATCAAC TAATT CAGGGAAAGTTGGTATTATAGGT AC TC CCATGAC 
AAAGTATCTAATTTACCAACTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAA 
AGCGCAGCTATCAAATCAACTAATTTAGGGAAAGTTGGTATTATAGGTACTCCCATGAC 

AAAGT ATCTAATTTACCAACTAATGGGGACAATGTTTC AT AAACCACCTTTTTGGCTAA 

AG C GC AG C TAT C AAAT C AAC TAATT TAG GG AAAGT TGGT AT TAT AGGT AC T C C CAT G AC 

AAAGTA 

AAAGTATCTAATTTACCAACTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAA 
AAAGTATCTAATTTACCAACTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAA 

AAAGTATCTAATTTACCAACTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAA 

GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGT 
GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGT 
GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGT 
CTAGAAGACATCTGATTTGATTCCACAATTGGAACAAATTTCGGACAAGCAAGGGATAC 
GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGT 

C TAGAAGAC ATCTGATTTGATT CC AC AATTGGAAC AAATTT CGGAC AAG C AAGGGATAC 

GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGC 

CTAGAAGACATCTGATTTGATTCCACAATTGGAACAA 

CTAGAAGA 

C TAGAAGACAT C TGATT TGATT C CAC AATTGGAACAAAT TTCGGAC AAGCAAGGGATAC 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TCCCTTGCTTGTCCGAAATTTGTTCCAATTGTGGAATCAAATCAGATGTCTTCTAGTTT 
TCCCTTGCTTGTCCGAAATTTGTTCCAATTGTGGAATCAAATCAGATGTCTTCTAGTTT 
TCCCTTGCTTGTCCGAAATTTGTTCCAATTGTGGAATCAAATCAGATGTCTTCTAGTTT 
ACAGCAGTATTTGGAGACAAAGCTTGAATTTTTTGACGATAAGCATCTGATTTAACAGT 
TCCCTTGCTTGTCCGAAAT - 

ACAGCAGTATTTGGAGACAAAGCTTGAATTTTTTGACGATAAGCATCTGATTTAACAGT 
ACAGCAGTATTTGGAGACAAAGCTTGAATTTTTTGACGATAAGCATCTGATTTAACAGT 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ180 8 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



GCCAAAAAGGTGGTTTATGAAACGTTGTCCCCATTAGTTGGTAAATTAGATACTTTAAT 
GC C AAAAAGGTGGTTTATGAAACGTTGTCC C CATTAGTTGGTAAATTAGATAC TTTAAT 
GCCAAAAAGGTGGTTTATGAAACGCTGTCCCCATTAGTTGGTAAATTAGATACTTTAAT 
ATGGGAGTACCTATAATACCAACTTTCCCTAAATTAGTTGATTTGATAGCTGCGCTAGC 



ATGGGAGT ACCTATAA - 



ATGGGAGT AC CTATAATACCAACTTTCC CTGAATAB CMARATVS TNC S RATNGTS AGGT 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TTAGGTTGCACGCATTATCCCTTATTACGTCCCATCATTCAAAATGTTATGGGGGCTGA 
TTAGGTTGCACGC ATTATC C C C TATTACGTCC CATCATT CAAAATGTTATGGGGGCTGA 
TTAGGTTGCACGCATTATCCCTTATTACGTCCCATCATTCAAAATGTTATGGGGGCTGA 
CCTGGTAAAATAACGCCTAAAACAGGGATGTCTAGTTTTTCTTTAATTTCTTGCCAGGC 



AMATRACMAS- 



SEQ1801 

SEQ1802 

SEQ1803 

SEQ1804 ' 

SEQ1805 

SEQ1806 

SEQ1807 

SEQ1808 

SEQ180 9 

SEQ1810 

SEQ1811 

SEQ1812 

SEQ1813 

SEQ1814 



GTTAAATTAATTGATAGTGGCGCAGAAACCGTTCGTGATATTTCTGTTTTATTGAACTA 
GTTAAATTAATTGATAGTGGCGCAGAAACCGTTCGTGATATTTCTGTTTTATTGAACTA 
GTTAAATTAATTGATAGTGGCGC AGAAAC CGTTCGTGATATTTC TGTTTTATTGAACTA 
ACTGCAGTTGCTGTATTACAAGCTATAACAATCATCTTAACATTTTTAGTCAATAAGAA 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ1801 
SEQ18 02 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 

SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 

SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ180 8 
SEQ180 9 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TTTGAGATAAACCATAATTGGCAAAATAAACACGGTGGTCATCACTTTTACACAACCGC 
TTTGAGATAAACCATAATTGGCAAAATAAACACGGTGGTCATCACTTTTACACAACCGC 
TTTGAGATAAMCCATAATTGGSMAAATAAACACGGTGGTCATCACTTTTACACAACCGS 
TTAACCATCTGCCAGGTAAACTCTCTAATCTGTTGAGCAGGTCTAGGACCATACGGAGC 

AGCCCAA 

AGC CCAAAAGGTTTTAAAGAAA 

AGCCCAAAAGGTTTTTAAGGAAATTGCAGAACAATGGCTTAATCAAGAAATAAAT 

CTAGCCTGATCTCCAATGAAGATTACTTCCTCTTCTGGAAGTTGACGGAACATTTCCTT 

ACAACCGTTAAACCACCT 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ ID NO. 1901: SAG1680 FROM THE 2603 V/R GBS TYPE V STRAIN 

ATCCCTAGACCATTATAAGCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAAC 
CAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGGATCGTTAATTAAACTATAATTATCTAATGGCCTCATTCCTA 
AACTAGTAGCATCAATATAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACT 
ACCTTTATT TGAAACTGTTT TTTAATTTTATCTGATAAGTCAATGACC T TATCGTAATTTGAGCTGTTACGATTAAATAA 
TCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTA 
TTTTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAAACGT 
CCGGTT C C ACCTTGATTAACGATAGTATTTACAGC AC C C ACTAATTTAGCTTGAGGAGATAAATC ATCTAGCAAAGGGAT 
AACACTCTGTTTAAATGGC ATTGAAACATTAACAC C ACGAATACCCAATGC C CTGACAC C T CGAACAGCTTCTGTTAATT 
TAC C CTC TT CT ACTTCAAATGTCAGATAGGC ATAATTC ATGTTTTTTTCTTGAAAAGAGGTATTC CACATTAACGGGGAT 
AGAGAGTGGCGTGCAGG 

SEQ ID NO. 1902: SAG1680 PROM THE H36b GBS TYPE lb STRAIN 

GTTATTAATTGAAATGCTTCTGCT CCTTGATAAATC AGC AT C CCTAGAC CATTATAAGCATGTTTCACTCCATTTTGTCT 
AACAAATCGTAACAATGCTGTTtCTTTAGGCTTGTAAACCAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGGAT 
CGTTAATTAAAC TATAATTATCTAATGGC CTC ATTC C TAAAC TAGTAGCATC AATATAAAAATGACTAGTTC TAATAGCG 
TCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTC 
AATGACCTTATCGTAATTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAA 
CTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTA 
AAAC AACC AATGC CATCTGTC ATATGGC CTACT AAACGTCCGGTTC CAC CTTGATTAACGATAGTATTTAC AGCACCCAC 
TAATTTAGCTTGAGGAGATAAATCATCTAGCAAAGGGATAACACTCTGTTTAAATGGCATTGAAACATTAACACCACGAA 
TACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTCAAATGTCAGATAGGCATAATTCATG 
TTTTTTTCTTGAAAAGAGGTATT C CACATTAACGGGGAT AGAGAGTGGC GTGCAGGA 

SEQ ID NO. 1903: SAG1680 FROM THE M732 GBS TYPE III STRAIN 

CTGGTCTAATTGCCAATCCTGCACGCCACTCTCTATCCCCGTTAATGTGGAATACCTCTTTTCAAGAAAAAAACATGAAT 
TATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGAGTATTCGTGG 
TGTTAATGTTTCAATGCCATTTAAACAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGTG 
CTGTAAATACTATCGTTAATCAAGGTGGAACCGGACGTTTAGTAGGCCATATGACAGATGGCATTGGTTGTTTTAAAGCT 
TTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAATTACAATAGCTGGTATTGGTGGTTCAGGTAAAGCAGTTGCAGT 
TCAAGCAGCTATGGAGGGAGTTGCGGAAATTAGATTATTTAATCGTAACAGCTCAAATTACGATAAGGTCATTGACTTAT 
CAGATAAAATTAAAAAACAGTTTCAAATAAAGGTAGTCGTTGATTATCTAGAAAATAAGACAGCATTTAAAGACGCTATT 
AGAACTAGTCATTTTTATATTGATGCTACTAGTTTAGGAATGAGGCCATTAGATAATTATAGTTTAATTAACGATCCAGA 
TATTTTAACACCGAATTTAGTAGTTGTCGACTT 

SEQ ID NO. 1904: SAG1680 FROM THE M781 GBS TYPE III STRAIN 

AAATCAGCATCCCTAGACATTATAAGCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCT 
TGTAAACCAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGGATCGTTAATTAAACTATAATTATCTAATGGCCTC 
ATTCCTAAACTAGTAGCATCAATATAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATC 

TAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAACTGCTTTACCTGAACCACCAATACCAGCTATT 
GTAATTATTTTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTAC 
TAAACGTCCGGTTCCACCTTGATTAACGATAGTATTTACAGCACCCACTAATTTAGCTTGAGGAGATAAATCATCTAGCA 
AAGGGATAACACTCTGTTTAAATGGCATTGAAACATTAACACCACGAATACTCAATGCCCTGACACCTCGAACAGCTTCT 

CGGGGATAGAGAGTGGCGTGCA 

SEQ ID NO. 1905: SAG1680 FROM THE 090 GBS TYPE la STRAIN 

GTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCATTTAAACAGAGTGTTATCCCtTTGCTArA 
TGATTTATCTCCTCAAGCTAAATTAGTGGGTGCTGTAAATACTATCGTTAATCAAGGTGGAACCGsACGTTTAGTAGGCC 
ATATGACAGATGGCATTGGTTGTTTTAAAGCTTTAGC AGCTC AAGGT TT CAGTGC TAAAAATAAAATAGTTAC AATAGC T 
GGTATTGGTGGTTCAGGTAAAGCAGTTGCAGTTCAAGCAGCTATGGAGGGAGTTGCGGAAATTAGATTATTTAATCGTAA 
TAG CTCAAATT ACGATAAGGTCATTGAC T TATC AGATAAAATTAAAAAAC AGTT TC AAATAAAGGTAGTCGTTGATTATC 
TAGAAAATAAGACAGCATTTAAAGACGCTATTAGAACTAGTCATTTTTATATTGATGCTACTAGTTTAGGAATGArGCCA 
TTAGATAATTATAGTTTAATTAACGATCCAGAAATTTTAACACCCAATTTAGTAGTTGTCGACTTGGTTTACAAGCCTAA 
AGAAACAGCATTGTTACGATTTGTTAGACAAAATGGAGTGAAACATGCTTATAATGGTCTAGGGATGCTGATTTATCAAG 
GAGCAGA 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ ID NO. 1906: SAG1680 FROM THE A909 GBS TYPE la STRAIN 

CCCTAGACCATTATAATCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACCA 
AGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGGATCGTTAATTAAACTATAATTATCTAATGGCCTCATTCCTAAA 
CTAGTAGCATCAATATAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTAC 
CTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTAATTTGAGCTGTTACGATTAAATAATC 
TAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATT 
TTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAAACGTCC 
GGTT CCAC CTTGATTAACGATAGTATTTACAGC ACCCACTAATTTAGCTTGAGGAGATAAAT CAT CTAGC AAAGGGATAA 
CACTCTGTTTAAATGGCATTGAAACATTAACACCACGAATACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTA 
CCCT CTT CTACTT CAAATGTC AGATAGGCATAAT TC ATGT TTTTTTCTTGAAAAGAGGTATTC C ACATTAACGGGGATAG 

SEQ ID NO. 1907: SAG1680 FROM THE COH1 GBS TYPE la STRAIN 

TGCACGCCACTCTCTATCCCCGTTAATGTGGAATACCTCTTTTAAGAAAAAAACATGAATTATGCCTATCTGACATTTGA 
AGTAGAAGAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGAGTATTCGTGGTGTTAATGTTTCAATGCCAT 
TTAAACAGAGTGTTATCC CTTTGCTAGATGATTTATCTC C TCAAGCTAAATTAGTGGGTGCTGTAAATACT 

SEQ ID NO. 1908: SAG1680 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

ATTCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGCATCCCTAGACCATTATAAGCATGTTTCACTCCATTTT 
GTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACCAAGTCGACAACTACTAAATTGGGTGTTAAAATTTCT 
GGATCGTTAATTAAACTATAATTATCTAATGGCCTCATTCCTAAACTAGTAGCATCAATATAAAAATGACTAGTTCTAAT 
AGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATA 
AGTCAATGACCTTATCGTAATTTGAGCTATTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAACTGCTTGAACT 
GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAACTATTTT 

SEQ ID NO. 1909: SAG1680 FROM THE CJB110 GBS NONTYPEABIiE STRAIN 

ACTCTCTATCCCCGTTAATGTGGAATACCTCTTTTC?^AGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAA 
GAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCATTTAAACA 
GAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGTGCTGTAAATACTATCGTTAATCAAGGTG 
GAACCGGACGTTTAGTAGGCCATATGACAGATGGCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAA 
AATAAAATAGTTACAATAGCTGGTATTGGTG 

SEQ ID NO. 1910: SAG1680 FROM THE 1169NT1 GBS TYPE V STRAIN 

ATTCGTTATTAATTGAAATGCTTCTGCTC CTTGATAAAT CAGCATCC CTAGACC ATTATAAGC ATGTTT C ACTC CATTTT 
GTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACCAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCT 
GGATCGTTAATTAAAC TATAATTAT C TAATGGC CT CATTC CTAAACTAGTAGCAT C AATATAAAAATGAC TAGTTCTAAT 
AGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATA 
AGTCAATGACCTTATCGTAATTTGAGCTGTTACGAT 

SEQ ID NO. 1911: SAG1680 FROM THE 1169NT1 GBS TYPE V STRAIN 

ACTTCTCTATTCCCCGTTAATGTGGAATACCTCTTTTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAG 
AAGAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCATTTAAA 
CAGAGTGTTATCCCTTTGCTAGATGATTTATCTC CTCAAGCTAAATTAGTGGGTGC TGTAAATACTATCGTTAAT CAAGG 
TGGAACC 

SEQ ID NO. 1912: SAG1680 FROM THE 18RS21 GBS TYPE II STRAIN 

TCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCATCATCCCTAGACCATTATAAGCATGTTTCACTCCATTTTGT 
CTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACCAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGG 
ATCGTTAATTAAACTATAATTATCTAATGGCCT CATTC CTAAAC TAGTAGC AT CAATATAAAAATGAC T AGTTCTAATAG 
CGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAG 
TCAATGACCTTATCGTAATTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAAC 

SEQ ID NO. 1913: SAG1680 FROM THE 18RS21 GBS TYPE II STRAIN 

ATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGT 
GTTAATGTTTCAATGCCATTTAAACAGAGTGTTATC CC TTTGCTAGATGATTTAT CTC CTCAAGC TAAATTAGTGGGTGC 
TGTAAATACTATCGTTAATCAAGGTGGAACCGGACGTTTAGTAGGCCATATGACAGATGGCATTGGTTGTTTTAAAGCTT 
TAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAATTACAATAGCTGGTATTGGTGGTTCAGGTAAAGCAGTTGCAGTT 
CAAGCAGCTATGGAGGGAGTTGCGG 



566 



WO 2004/018646 



PCT/US2003/026827 



Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 

SEQ ID NO. 1914: SAG1680 FROM THE JM9130013 GBS TYPE VIII STRAIN 

CCCTAGACCATTATAAGTCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACC 
AAGTCGACAACTACTAAATTGGGTGTTAAAATTTCTGGATCGTTAATTAAACTATAATTATCTAATGGCCTCATTCCTAA 
ACTAGTAGCATCAATATAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTA 
CCTTTATTT<3AAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTAATTTGAGCTATTACGATTAAATAAT 
CTAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAACTAT 
TTTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCAT 

SEQ1901 : ATCCCT 

SEQ1902 GTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGC ATCCCT 

SEQ1903 TGGTCTAATTGCCAATCCTGCACGCCACTCTCTAT- CCCCGTTAATGTGGAATACCTCT 

SEQ1904 AAATCAGC ATCCCT 

SEQ1905 

SEQ1906 CCCT 

SEQ1907 TGCACGCCACTCTCTAT- CCCCGTTAATGTGGAATACCTCT 

SEQ19 0 8 ATTCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGCATCCCT 

SEQ19 0 9 ACTCTCTAT- CCCCGTTAATGTGGAATACCTCT 

SEQ1910 ATTCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGCATCCCT 

SEQ1911 ACTTCTCTATTCCCCGTTAATGTGGAATACCTCT 

SEQ1912 - - TCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCATCATCCCT 

SEQ1913 

SEQ1914 CCCT 

SEQ1901 GACCATTATAAG-CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1902 GAC C ATTATAAG - CATGTTTCAC T C CATTTTGTCTAAC AAAT CGTAACAATGCTGTTTC 

SEQ19 0 3 TTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ1904 GAC - ATTATAAG - CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1905 

SEQ1906 GACCATTATAAT - CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1 907 TT -AAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ19 0 8 GAC CAT T AT AAG - C ATGT T T C AC T C CAT T TTGT C TAAC AAAT C GT AAC AATG C TGT T T C 

SEQ1 909 TTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ1910 GACCATTATAAG - CATGTTTCAC T C C ATTTTGT CTAACAAATCGTAACAATGC TGTTTC 

SEQ1911 TTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ1912 GACCATTATAAG- CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1913 ATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ1914 GACCATTATAAGTCATGTTTCACTCCATTTTGTCTAACAAAT CGTAACAATGCTGTTTC 

SEQ1901 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATT TC TGGATCG 

SEQ 1 9 0 2 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ1903 CAGAAGCTGTTCGAGGTGTCAGGGCATTGAGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1904 TTAGGCTTGTAAACCAAGTC- -GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ 1905 GTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1 9 0 6 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ1907 CAGAAGCTGTTCGAGGTGTCAGGGCATTGAGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1908 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTGGGTGTTAAAATTTCTGGATCG 

SEQ1 909 CAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1 9 1 0 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ1911 CAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1 9 12 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ1913 CAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1914 TTAGGCTTGTAAACCAAGTC- -GACAACTACTAAATTGGGTGTTAAAATTTCTGGATCG 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 

SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



TT -AATTAAACTATAATTATCT AATGGCCTCATTCCT- AAACTAGTAGCATCAAT 

TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT - AAACTAGTAGCATCAAT 

TTTAAACAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGT 

TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT - AAACTAGTAGCATCAAT 

TTTAAACAGAGTGTTATCCCTTTGCTARATGATTTATCTCCTCAAGCTAAATTAGTGGGT 

TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT - AAACTAGTAGCATCAAT 

TTTAAAC AGAGTGTTATC C CTTTGCTAGATGATTTATCTC CTC AAGCTAAATTAGTGGGT 

TT- AATTAAACTATAATTATCT- AATGGCCTC ATTCC T - AAACTAGTAGCATCAAT 

TTTAAACAGAGTGTTATCC CTTTGCTAGATGATTTAT CTC CTCAAGCTAAATTAGTGGGT 

TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT -AAACTAGTAGCATCAAT 

TTTAAACAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGT 

TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT - AAACTAGTAGCATCAAT 

TTTAAACAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGT 
TT- AATTAAACTATAATTATCT - AATGGCCTCATTCCT -AAACTAGTAGCATCAAT 

TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 
TAAAAATGACTAGTTCTAATAGCGT CTTTAAATGC TGT CTTATTTTCTAGATAATCAAC 
CTGTAAATACTATCGTTAATCAAGGTGGAACCGGACGTTTAGTAGGCCATATGACAGAT 
TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 
CTGTAAATACTATCGTTAATCAAGGTGGAACCGSACGTTTAGTAGGCCATATGACAGAT 
TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

C TGTAAATACT 

TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 
CTGTAAATACTATCGTTAATCAAGGTGGAAC CGGACGTTTAGTAGGC C ATATGACAGAT 
TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

CTGTAAATACTATCGTTAATCAAGGTGGAACC 

TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 
C T GT AAAT AC TAT CGT T AAT C AAGG T GG AAC C GG AC GT TT AGT AGG C C AT ATG AC AGAT 
TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 
ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 
GCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAATT 
ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 
GCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAGTT 
ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 
GCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAGTT 
ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 
GCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAATT 
ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 
TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 
CAATAGCTGGTATTGGTGGTTCAGGTAAAGCAGTTGCAGTTCAAGCAGCTATGGAGGGA 
TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 
CAATAGCTGGTATTGGTGGTTCAGGTAAAGCAGTTGCAGTTCAAGCAGCTATGGAGGGA 
TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 

TTTGAGCTATTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAACTGCTTGAAC 

CAATAGCTGGTATTGGTG 

TTTGAGCTGTTACGAT 

TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAAC 

C AAT AG C TGGTAT TGGTGGTTC AGGTAAAG C AGTTGC AGTT CAAGC AGCTATGGAGGGA 
TTTGAGCTATTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ1901 
SEQ19Q2 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACT 
GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACT 
TTGCGGAAATTAGATTATTTAATCGTAACAGCTCAAATTACGATAAGGTCATTGACTTA 
GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACT 
TTGCGGAAATTAGATTATTTAAT CGTAATAGC T C AAATTACGATAAGGTCATTGAC TTA 
GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACT 

GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAACTATTTT 

TTGCGG 

GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAACTATTTTATTTTTAGCACT 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



AAACCTTGAGCTGCTAAAGCTTTAAAACAAC C AATGC CATC TGTCATATGGCC TACTAA 
AAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAA 
CAGATAAAATTAAAAAACAGTTTCAAATAAAGGTAGTCGTTGATTATCTAGAAAATAAG 
AAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAA 
CAGATAAAATTAAAAAACAGTTTCAAATAAAGGTAGTCGTTGATTATCTAGAAAATAAG 
AAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAA 



AAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCAT - TABCMARAT - 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



CGT CCGGTTCCACC TTGATT AACGATAGT ATTTAC AGCACCC ACTAATTTAGCTTGAGG 
CGTCCGGTTCCACCTTGATTAACGATAGTATTTACAGCACCCACTAATTTAGCTTGAGG 
CAGCATTTAAAGACGCTATTAGAACTAGTCATTTTTATATTGATGCTACTAGTTTAGGA 
CGTCCGGTTCCACCTTGATTAACGATAGTATTTACAGCACCCACTAATTTAGCTTGAGG 
CAGCATTTAAAGACGCTATTAGAACTAGTCATTTTTATATTGATGCTACTAGTTTAGGA 
CGTCCGGTTCCACCTTGATTAACGATAGTATTTACAGCACCCACTAATTTAGCTTGAGG 



STNCSRATNGTSASHKMATDHYDRGNAS - 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



GATAAATCATCTAGCAAAGGGATAACACTCTGTTTAAATGGCATTGAAACATTAACACC 
GATAAATCATC TAGCAAAGGGATAAC ACTCTGTT TAAATGGCATTGAAAC ATTAAC ACC 
TGAGGCCATTAGATAATTATAGTTTAATTAACGATCCAGATATTTTAACACCGAATTTA 
GATAAATCATC TAGCAAAGGGATAAC ACTCTGTTTAAATGGCATTGAAACAT T AAC AC C 
TGARGC CATTAGATAATTATAGT TTAATTAACGATCCAGAAATTTTAAC AC CC AATTTA 
GATAAATCATCTAGCAAAGGGATAACACTCTGTTTAAATGGCATTGAAACATTAACACC 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 

SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 

SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



CGAATACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTC 
CGAATACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTC 
TAGTTGTCGACTT 

CGAATACTCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTC 
TAGTTGTCGACTTGGTTTACAAGCCTAAAGAAACAGCATTGTTACGATTTGTTAGACAA 
CGAATACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTC 

AATGTCAGATAGGCATAATTCATGTTTTTTTCTTGAAAA 

AATGTCAGATAGGCATAATTCATGTTTTTTTCTTGAAAAGAGGTATTCCACATTAACGG 

ATGGAGTGAAACATGCTTATAATGGTCTAGGGATGCTGATTTATCAAGGAGCAGA 

AATGTCAGATAGGCATAATTCATGTTTTTTTCTTGAAAAGAGGTATTCCACATTAACGG 

GATAGAGAGTGGCGTGC AGG - 
GATAGAGAGTGGCGTGCAGGA 

GATAGAGAGTGGCGTGCA 

GAT AG 
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Table 20: Comparative Sequences relating to SAG1723 (signal peptidase I) 



SEQ ID NO- 2001: SAG1723 FROM THE COHl GBS TYPE la STRAIN 

ATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTT^ 

ATCAAATATAAAAATGACAC C T TAACT ATTAAGAATAAAAAAACAGAAGAAC C T TAC C T CAAGGAATATACTAAAT TAT T TAAAAAGGA 

TAAATTACAGGAAAAATATTCGTATAACCC!ACTTTTCCAAG^ 

GCGAATTTACTACTGTCGTGCCTAAAGGCCACTACTATCTTGTTGX5T 

TTCAAAA 

SEQ ID NO, 2002: SAG1680 FROM THE CJBllO GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

TAAAGTTGACGGACACTCCATGGATCCAACTTT^ 

TTGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTGTCAT^ 

AATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATACTAAATTATT 

AAAATATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCT 

CTGTCGTGCCTAAAGGCCACTATTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGTCCCTTCAAAAAATCA 
ACAATTGTGGGAG 

SEQ ID NO, 2003: SAG1680 FROM THE 18RS21 GBS TYPE II STRAIN 

TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGA 

GTGGCTAACGAAGAAGAAGG CGGC CAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCAT CAAATATAAAAATGA 
CACC TTAACTAT TAACAATAAAAAAACAGAAGAACCT TAC CT CAAGGAATATACTAAAT TAT T T AAAAAGGATAAATTACAGGAAAAAT 
ATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTCACC^ 

GTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGTCCCTTCAAAAAATCAACGAT 
TGTGGGAGAGGT 

SEQ ID NO. 2004: SAG1680 FROM THE 2603 V/R GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

AAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 
GTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTT 

TGACACCTTAACTATTAACAATAAAAAAAC^GAAGAACCTTACCTCAAGGAATATACTAAATTATTTAAA 
AATATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTCACCA 

GTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGT 

SEQ ID NO. 2005: SAGI680 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

TTGACGGACACTCCATGGATCCAACTTTAGCTGAC^ 
G^CTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTC 

CCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATACTAAATTATTTAAAAAGGATAAATT^ 

TCGTATAACCraCTTTTCCAAGACCTAGCACAAAGC^ 

GCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGA 

SEQ ID NO. 2006: SAG1680 FROM THE M781 GBS TYPE III STRAIN 

TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGi^ 

GTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTGTCATTGG 
C!ACCTTAACTATTAACAATAAAAAAACAGAAGAACCT 

TATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTCA 

SEQ ID NO. 2007: SAG1680 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

TTGGTAAAGTTGACGGACACTCCATGGATCCAACTTTAGCTG^ 
GATATTGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAAC^ 
TAAAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATACTAAA 
AGGAAAAATATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCT^ 

ACCACTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGCCCCTTCAAAAA 
ATCAACG 

SEQ ID NO. 2008: SAG1680 FROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAAC&GCTAGTAGTTCTC 

GTGGCTAACGAAGAAGAAGGCGGC CAAAAGAAAAAAATTGTTAAACGTGT CATTGGTATGCCAGGTGATGTCATCAAATATAAAAATGA 
CACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTaAAGGAATATACTAAATTATTTAAAAA 
ATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAG 
GTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGA 

SEQ ID NO. 2009: SAG1680 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

TAAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGA 

TTGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTGTCIATTGGTATGCCAGGTGATGTCA 

AATGAC^CCTTAACTATTAACAATAAAAAAACAGAAGAACCTT^ 

AAAATATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTA 

CTGTCGTGCCTAAAGGCCACTATTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGT 
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Table 20: Comparative Sequences relating to SAG1723 (signal peptidase I) 



SEQ ID NO- 2010: SAG1680 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

AAAGTT(^CGGACACTCCATGGA 

TGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGC 

ATGA(^CCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATACTAA^ 

AAATATTCGTATAACCGACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTC^C^ 

TGTCGTGCCTAAAGfGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATA 

CG 



SEQ2001 

SEQ2 002 TAAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 003 TTGACGGAC ACTC CATGGATC CAACTTTAGC TGACAAGGAACAGC TAGTAG 

SEQ2 004 AAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 005 TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 006 - TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 007 TGGTAAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 008 TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 009 TAAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 010 AAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2001 , ATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2002 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2003 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2004 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2 005 TCTCAAACAAACAAAA- - TAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2 006 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2007 T C T C AAAC AAAC AAAAAT C AAT C GAT T CG AT AT TGTAGTGGC T AACG AAG AAGAAGG CG , 

SEQ2 008 T C T C AAAC AAAC AAAAAT C AAT CGAT T C GAT AT TGT AGTGG C T AAC GAAGAAG AAG G C G 

SEQ2 009 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2010 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2 001 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 002 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 003 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 004 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 005 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 006 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 007 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 008 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 009 GCCAAAAGAAAAAAATTGTTAAACGTGT CATTGGTATGCCAGGTGATGTC AT CAAATATA 

SEQ2 010 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 001 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATA 

SEQ2 002 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATA 

SEQ2 003 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATA 

S EQ2 004 AAAATGAC ACCTTAACTATTAACAATAAAAAAACAGAAGAAC C TTAC CTCAAGGAATATA 

SEQ2005 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATA 

SEQ2006 AAAAT G AC AC C T T AAC TAT T AAC AAT AAAAAAAC AGAAG AAC C T T AC C T C AAG GAAT AT A 

SEQ2007 AAAATGACAC CTTAACTATTAACAAT AAAAAAAC AGAAGAACCTTAC CTCAAGGAATATA 

SEQ2 008 AAAATGACAC CTTAACTATTAAC AATAAAAAAACAGAAGAACC TTAC C TCAAGGAATAT A 

SEQ2 009 AAAATGACACCTTAACTATTAAC AATAAAAAAACAGAAGAACC TTAC CTCAAGGAATATA 

SEQ2 010 AAAATGACAC CTTAACTATTAACAAT AAAAAAACAGAAGAACC TTACCT CAAGGAATATA 

SEQ2 001 CTAAATTATTT- AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2 002 CTAAATTATTT - AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2003 CTAAATTATTT-AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2004 CTAAATTATTT - AAAAAGG AT AAAT T AC AGGAAAAAT AT T C GT AT AAC C C AC T T T T C C AA 

SEQ2 005 CTAAATTATTT- AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2 006 CTAAATTATTTTAAAAAGGATAAATTAC AGGAAAAAT ATTCGTATAAC C C ACT TTT CCAA 

SEQ2007 CTAAATTATTT - AAAAAGGATAAATTAC AGGAAAAATATTCGTATAAC C C AC TT TTCCAA 

SEQ2 008 CTAAATTATTT - AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2009 CTAAATTATTT - AAAAAGGATAAATTAC AGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2010 CTAAATTATTT - AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 
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Table 20: Comparative Sequences relating to SAG1723 (signal peptidase I) 



SEQ2001 
SEQ2002 
SKQ2003 
SEQ2004 
SEQ2005 
SEQ2006 
SEQ2007 
SEQ2 008 
SEQ2009 
SEQ2 010 

SEQ2001 
SEQ2002 
SEQ2003 
SEQ2004 
SEQ2005 
SEQ2006 
SEQ2 007 
SEQ2008 
SEQ2009 
SEQ2010 

SEQ2 001 
SEQ2 002 
SEQ2003 
SEQ2004 
SEQ2005 
SEQ2 00 6 
SEQ2007 
SEQ2008 
SEQ2009 
SEQ2010 

SEQ2001 
SEQ2002 
SEQ2003 
SEQ2004 
SEQ2005 
SEQ2006 
SEQ2007 
SEQ2008 
SEQ2 009 
SEQ2010 



GACCTAGCACAAA.GCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GAC CTAG C AC AAAGCT CTAC CGCTTTCACTACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GAC CT AGC ACAAAGCTCTAC CGCTTT CAC CACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACTACTGACAGCAATGGCAGCAGCGAATTTACC 
GAC CTAGC ACAAAGC TCTAC CGCTTT CAC CACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACTACTGACAGCAATGGCAGCAGCGAATTTACT 
GAC CTAGC AC AAAGCT CTAC CGC TTT CACCACTGACAGCAATGGC AGCAGCGAATTTACT 

CTGTCGTGCCTAAAGGC CACTACTAT CTTGTTGGTGATGAC CGAATTGTCTCTAAAGAT 
CTGTCGTGCCTAAAGGCCACTATTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 
CTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 
CTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 
CTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGA 

CTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 

CTGT CGTGC CTAAAGGCCACTACTATCTTGT TGGTGATGACCGA 

CTGTCGTGCCTAAAGGCCACTATTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 
CTGTCGTGC C TAAAGGC CACTACTATCTTGTTGGTGATGAC CGAATTGTCTCTAAAGAT 

GTCGTGCCGTCGGTTCCTTCAAAA 

GTCGTGCCGTCGGTCCCTTCAAAAAATCAACAATTGTGGGAG 

GTCGTGCCGTCGGTCCCTTCAAAAAATCAACGATTGTGGGAGAGGT 

GTCGTGCCGTCGGT 

GTCGTGCCGTCGGCCCCTTCAAAAAATCAACG 

GTCGTGCCGTCGGT 

GTCGTGCCGTCGGTCCCTTCAAAAAATCAACGTABCMARATVSTNCSRATNGTSAGSGN 
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Table 21: Comparative Sequences relating to SAG0079 (adenylate kinase) 

SEQ ID NO. 2101: SAGO 07 9 FROM THE 2603V/R GBS TYPE V STRAIN 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTC1AAGCAGCTAAGATCGTTGAAGA 

ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATT 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTAC 

CGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 
CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACITGGTG 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAATATTG 

GAACCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATC^ 

GAAAAAGCGTTG 

SEQ ID NO. 2102: SAG0079 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT^ 
ACAGGGGATATGTTCCGCGCCGCAATGGCTA^ 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGC^ 

CGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAa^TGGTGTTATTAATATTAAAGTGGAT 
CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATC^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCA^ 

GAACCTATTCTTGAAC^CTATCGTAAGCTTGGTCTTGTTACAGATATTGAAG 

GAAAAAGCGTTGCTAGAACTCAAA 

SEQ ID NO. 2103: SAG0079 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

TGGTAAAGGGACTCAAGCAGCTAAGATTGTTG 

TAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCC 

AGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGGTATCCACGTACTATTGAACAAGCA 

TGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTTATAGAGCGTTTGAGTGG 

TCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTGTTCAACCCACCAGTAGATTATAAAGA 

TGAAGATGATAAGCCTGAAACTGTGAAACGTCGCTTGGAC^ 

TGGCCTTGTTACAGATATTGAAGGTAATCAAGAAATAA 



SEQ ID NO. 2104: SAG0079 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAACCACGGGTTCGCCTGGTGCTGGTAAAGGTACTCA 

ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTT 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAA 

CGTACTATTGAACAAGCAC1ACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATT 

CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCC^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTOVAACGTC^ 

GAACCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACA 

GAAAAAGCGTTGCTAGAA 

SEQ ID NO. 2105: SAG0079 FROM THE 2603V/R GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCTCACATCTCA 
ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAA 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGG 

CGTACTATTGAACAAGCAC^CGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 

CCAT<^TGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACT^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTC^ 

GAACCTATTCTTGAACA.CTATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAA.GA 

GAAAAAGCGTTG 

SEQ ID NO. 2106: SAG0079 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCTCAC^TCTCA 
ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGG 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGATATCCA 

CGTACTATTGAAO^GCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 

CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCA^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAAro 

GAATCTATTCTTGAACACTATCGAAAGCTTGGTCTTGTTACAGATATTGAAGGTAA 

SEQ ID NO. 2107: SAG0079 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAACCACGGGTTTGCTTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCTCACATCTCA 

ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGT^ 

CCTGATGAAGTAAC!AAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAG 

CGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 
CC^TCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAA 
GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCT^ 
GAACCTATTCTTGAACACTATAG 
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Table 21: Comparative Sequences relating to SAG0079 (adenylate kinase) 

SEQ ID NO. 2108: SAGO 07 9 FROM THE COH1 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATTGTTGAAGAATTTGGTGTTGCTCA 
C^GGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCCAAATGGGACGTTT^^ 

CTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGATA 

GTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTTGAA 

CAACATGCCTTATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAA 

ATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGC^ 

AACCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTAC^GATATTGAAGGTAATCAAGAAATAACAGAAGTTTTTGC^ 
AAAAAGCGTTGCTAG 

SEQ ID NO. 2109: SAG0079 FROM THE H36b GBS TRYP lb STRAIN (REVERSE COMPLEMENT) 

C^GGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTC 
CTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGATATCCAC 
GTACTATTGAACAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATC 
CATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATC 

ATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAA 

AATCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAG^ 

AAAAAGCGTTGCT 

SEQ ID NO. 2110: SAG0079 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE 
COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCTCACATCTCA 

ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTT 

CCTGATGAAGTAAGAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGA 

CGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 

CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAAT^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGA 

GAACCTATTCTTGAACACTATAAAAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCA 

SEQ ID NO. 2111: SAG0079 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

CTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACT 

GGGGATATGTT CCG CGCCGCAATGGCTAATCAAACCCAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGT TCCT 
GATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGA 

ACTATTGAGCAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAA 
ACATGCCTTATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAACT^ 

TATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAAT^ 

CCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGT^ 

AAAGCGTTGCTAGAACTCAAA 

SEQ ID NO. 2112: SAG0079 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTACGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGGR.GCTAAGATTGTTGAAGAATTTGGTG 
ACAGGGGATATGTTCCGCGCCGCAATGGCTAATOVAACCCAAATGG 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGC^GAAAAAGGTTTTTTACTTGATGGATATCCA 
CGTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 
CCAACATGCCTTATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAA^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAA 



SEQ2 101 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2 10 2 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2 10 3 TG GTAAAGGGAC T CAAG CAG C TAAGATTGTT 

SEQ2104 ATCTTTTAACCACGGGTTCGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ 2 105 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2 106 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2 107 ATCTTTTAACCACGGGTTTGCTTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2108 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATTGTT 

SEQ2109 

SEQ2 11 0 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2 111 - - CTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATTGTT 

SEQ2 112 ATCTTTTAATTACGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATTGTT 

SEQ2101 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2 102 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2 103 AAGAATTTGGTGTTGCGCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2104 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ 2 105 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2 106 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2107 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2 10 8 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 
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Table 21: Comparative Sequences relating to SAG0079 (adenylate kinase) 



SEQ2109 
SEQ2110 

SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ210 5 
SEQ2106 
SEQ2107 
SEQ2108 
SEQ210 9 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ210 6 
SEQ2107 
SEQ210 8 
SEQ2109 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ2X08 
SEQ2109 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ2108 
SEQ2109 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ210 8 
SEQ2109 
SEQ2110 
SEQ2111 
SEQ2112 



CAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

AAGAATTTGGTGTTGCTCACATCTCAAC^GGGGATATGTTCCGCGCCGCAATGGCTAAT 

AAGAATTTGGTGTTGCTCAC^TCTC^C^GGGGATATGTTCCGCGCCGC^TGGCTAAT 
AAGAATTTGGTGTTGCT^C^TCTOVACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCCAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCCAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCCAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 

AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAA CAAACGGGAT TGTAAAAGAG CG CT TAGCTGAGGATGATAT CG CAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAA CAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAA CAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 

TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGGTATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTT 

GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCAACATGCCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCAACATGCCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCAACATGCCTT 

ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAAGTTTCCACAAAGTG 
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SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ210 6 
SEQ2107 
SEQ2108 
SEQ2109 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ210 8 
SEQ210 9 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ2108 
SEQ210 9 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ2108 
SEQ2109 
SEQ2110 
SEQ2111 
SEQ2112 



TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TT(^ACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 

GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAAGTGTCAAACGTCGCTTGGACGTTCATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAATCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAATCTATTCTTGAACAC 
GAAACTGTTAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCT TGGACGTTAATATTGCTCAAGGAGAAC CTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAATAB CMARATVSTNCS R AT 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 
ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 

ATAGTAAGCTTGGCCTTGTTACAGATATTGAAGGTAATCAAGAAATAA 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 
ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 

ATCGAAAGCTTGGTCTTGTTACAGATATTGAAGGTAA 

ATAG 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 
ATCGTAAG CTT GGT CT TGT TACAGATAT TGAAGGTAAT CAAGAAATAA CAGAAGT T T T T 

ATAAAAAGCTTGGTCTTGT TACAGATATTGAAGGTAATCA 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 
GTS AGADNYATKNAS 

CAGATGTTGAAAAAGCGTTG 

CAGATGTTGAAAAAGCGTTGCTAGAACTCAAA 



CAGATGTTGAAAAAGCGTTGCTAGAA- 
CAGATGTTGAAAAAGCGTTG 



CAGATGTTGAAAAAGCGTTGCTAG 
CAGATGTTGAAAAAGCGTTGCT- - • 



CAGATGTTGAAAAAGCGTTGCTAGAACTCAAA 
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>SEQ IB NO 2150:090 frame: 1 

NLLIMGLPGAGKGTQAAKIVEEFGVMISTGDMFRAAMANQTEMGRIxAKSYIDKGELVPD 
EVTNGIVKERIAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 
I ERLSGRI INRKTGET FHKVFNP PVDYKEEDYYQREDDKPETVKRRLDVNI AQGEP I LEH 
YRKLGLVTDI EGNQEI TEVFADVEKALLELK 

>SEQ ID NO 2151:114_1169NT frame: 2 

GKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPDQVTNGIVKER 
IiAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCLIERLSGRIIN 
RKTGETFHKVFNPPVDYICEEDYYQREDDKPEWKRRLDVHIAQGEPILEHYSKLGLVTDI 
EGNQEI 

>SEQ ID NO 2152: 114__18RS21 frame: 1 

NLLTTGSPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRIiAKSYIDKGELVPD 
EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHAIiDATLEELGLRLDGVINIKVDPSCL 
IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 
YRKLGLVTDI EGNQE I TEV FADVE KALLE 

>SEQ ID NO 2153: 114_2603 frame: 1 

NLLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 
EVTNGI VKERLAEDDI AEKGFLLDG YPRTI EQAHALDATLEELGLRLDGVI NI KVDPS CL 
I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPEWKRRLDVNIAQGEPILEH 
YRKLGLVTDIEGNQEI TEVFADVEKAL 

>SEQ ID NO 2154: 114_A909 frame: 1 

NLIiIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 
EVTNGIVKERIAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCIi 
IERLSGRI INRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNIAQGES ILEH 
YRKLGL VTD I EG 

>SEQ ID NO 2155:114_A909 frame: 1 

NLL IMGLPGAGKGTQAAKI VEE FGVAH I S TGDMFRAAMANQTEMGRLAKS Y I DKGELVPD 
EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 
I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGE SI LEH 
YRKLGLVTDI EG 

>SEQ ID NO 2156: 114_CJB110 frame: 1 

NLLTTGLLGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 
EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 
IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPI LEH 
Y 

>SEQ ID NO 2157: 114_COHl frame: 3 

LLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTQMGRLAKSYIDKGELVPDE 
VTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCLI 
ERLSGRI I NRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGEP I LEHY 
RKLGLVTD I EGNQEI TEVF AD VEKALL 

>SEQ ID NO 2158: 114JH36B frame: 3 

GDMFRAAMANQTEMGRLAKSYIDKGELVPDEVTNGIVKERLAEDDIAEKGFLLDGYPRTI 
EQAHALDATLEELGLRLDGVINIKVDPSCLIERLSGRIINRKTGETFHKVFNPPVDYKEE 
DYYQREDDKPETVKRRLDVNIAQGESILEHYRKLGLVTDIEGNQEITEVFADVEKAL 

>SEQ ID NO 2159: 114_JM9130013 frame: 1 

NLLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 
EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 
I ERLS GRI INRKTGET FHKVFNPPVDYKE ED YYQREDDKP ETVKRRLDVNI AQGEP I LEH 
YKKLGLVTDIEGN 

>SEQ ID NO 2160:114_M732 frame: 1 

LL I MGL PGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTQMGRLAKS YI DKGELVPD E 
VTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCLI 
ERLSGRI INRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNIAQGEPILEHY 
RKLGLVTDIEGNQEITEVFADVEKALLELK 

>SEQ ID NO 2161: 114_M7 81 frame: 1 

NLL I TGLPGAGKGTQAAKI VEEFGVAHI S TGDMFRAAMANQTQMGRLAKS Y I DKGELVPD 
EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCL 
I ERLSGRI INRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNIAQ 



578 



WO 2004/018646 



PCT/US2003/026827 
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SEQ2 15 0 LLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMPRAAMANQTEMGRLAKSYIDKGELVPD 

SKQ2151 GKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SKQ2 152 LLTTGS PGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2153 LLI MGL PGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2154 LL I MGLPGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKSYIDKGELVPD 

SEQ2155 LL I MGL PGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKSYIDKGELVPD 

SEQ2156 LLTTGLLGAGKGTQAAKI VEEFGVAHI STGDMFRAA>IANQTEMGRLAKSYIDKGELVPD 

SEQ2 15 7 LLI MGL PGAGKGTQAAKI VEEFGVAHISTGDMFRAAMANQTQMGRLAKS YIDKGELVPD 

SEQ2158 GDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2 15 9 LLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2 16 0 LIo I MGL PGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTQMGRLAKS YIDKGELVPD 

SEQ2161 LL I TGL PGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTQMGRLAKS YIDKGELVPD 

SEQ2 15 0 FOTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2 15 1 QVTNGIVKERIiAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIK\rDPSCL 

SEQ2 15 2 EWNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2 15 3 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2154 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2155 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2 15 6 EVTNGIVKERIiAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2157 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCL 

SEQ2158 EVTNGI VKERLAEDDI AEKGFLLDG YPRT I EQAHALDATLEELGLRLDGVI NI KVDP S CL 

SEQ2159 EVTNGIVKERLAEDDIAEKGFLIiDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2160 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCL 

SEQ2 16 1 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINI KVDPTCL 

SEQ2150 IERLSGRIINRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 

SEQ2151 IERLSGRIINRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVHIAQGEPILEH 

SEQ2 152 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 

SEQ2 15 3 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGEPILEH 

SEQ2 154 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGESILEH 

SEQ2 15 5 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGESILEH 

SEQ2 15 6 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 

SEQ2157 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 

SEQ215 8 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVWIAQGESILEH 

SEQ2159 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEP ILEH 

SEQ2160 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 

SEQ2 161 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQ 

SEQ2150 RKLGLVTD IEGNQE ITEVFADVEKALLELK 

SEQ2151 SKLGLVTD IEGNQE I 

SEQ2 15 2 RKLGLVTD I EGNQEITEVFADVEKALLE - - 

SEQ2153 RKLGLVTD IEGNQE I TEVFADVEKAL 

SEQ2154 RKLGLVTD I EG 

SEQ2155 RKLGLVTDIEG 

SEQ2156 

SEQ2157 RKLGLVTD I EGNQE I TEVFADVEKALL 

SEQ2 15 8 RKLGLVTD I EGNQE I TEVFADVEKAL 

SEQ2159 KKLGLVTD I EGN 

SEQ2160 RKLGLVTD I EGNQE ITEVFADVEKALLELK 

SEQ2161 
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Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 

SEQ ID NO. 2201: SAG0093 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

AAGCCTAAC&GTCAACAATCATCATCT CAAAAGTTGAGGAATGAGG^ 

CAATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCGTGACCA 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCZAAGCTACTCAGTTTTTAG^ 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTC^ 

TTGACGAGGGGAC!AAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGC^ 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTA 

CGGTTTCCGGATGGTAAAAGAGGAGAAACAGGGGTAGGTTATGAAGATTGGCA 

ATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2202: SAG0093 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE COMPLEMENT ) 

AAGCCTAACAGTCAACAATCATCACCT CAAAAGTTGAGGAATGAGG 

CGATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCGTGACGATAAACATGAA 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGC!AAGCTACTCAGTTTT^ 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCTTATGTTACTCAAGA 

TTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGC 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTC^GTCAGTTGAAAA^ 

CGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGTCTGG 
ATGGCCGAACATCGTTTAAGA.TTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2203: SAG0093 FROM THE X8RS21 GBS TYPE II STRAIN 

AAGCCTAACAGTCAACAATCATCATCTCAAAAGTT 
CAATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTT 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTGCTAGAGCAATTGATTCACGAGAA 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGX^GAAGTTGTTCAATTCTTATGTTACTCAAGA 

TTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGACTGGATTAG 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTA 

CGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAG 

ATGGCCAAACATCATTTAACATTAGAAaAATACATAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2204: SAG0093 FROM THE 2603V/R GBS TYPE V STRAIN 

ACAGTCAACAATCATCATCTCAAAAGTTGAG 

CAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCGTGACC^TAAACA 
AAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCT^ 

TTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAAC 

GGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTC 

CTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCIAGTCAGTTGA 

CGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGTCTGCAAAATATATG 
AACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAAAACCCAGCTTTCTTGTAG 

SEQ ID NO. 2205: SAG0093 FROM THE A909 GBS TYPE la STRAIN 

AAG C CTAA CAGTCAACAAT CAT CATC T CAAAAGTTGAGGAATGAGGATATAAAAAAGACAT C C T CT CAAAAAAGAAATAAGAAAT TA 

CGATTACCAGCTGTATGA.TCAAAAGATTGGAACTTGATTTTGGTCAATC^ 

CCTGTTGAAAATATTTATTTGX-iATAAACGTATTACGAAGCAAGCTAC^ 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTC^ 

TTGACGAAGGAACAAGCAGAAAAGTTGGTAAAAACTTACTCTC!AGCCTGCAGGTGCTAGTG^ 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTG 

CGGTTTCCGGATGGTAAAACAGCAGAAA(ZAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGT 

ATGGCCAAAC^TCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2206: SAG0093 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

AAGCCTAACAGTCAACAATCATCATCTCAAAAGTTGAGGAATGA 

ACAATTACCAGCTGTATCATCIAAAAGATTGGAACTTGATTTTGGTCA^ 

TCCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAG 

ACATTTAATTTCGGGTTATCGTAGTGTTGCCTATC^GGAGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAA 
TTTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCT^ 

TATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATAT 

ACGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGC^ 

TATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 
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Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 

SEQ ID NO. 2207: SAGO 093 FROM THE COH1 GBS TYPE III STRAIN 

CCTAAGAGTC!AA CAAT CAT CAT CT CAAAAGT TGAGGAATGAGGATATAAA^ 

ATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCIAATCGTG^ 

TGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTC^^ 

TTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAA^ 

GACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCT 

GAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTC^GTTGAA 

GTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATT^ 

GGTCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAG^ 

SEQ ID NO. 2208s SAG0093 FROM THE H36b GBS TYPE lb STRAIN 

AAGCCTAACAGTCAACAATCATCATCTGAAAAGTTGAGGAATGAGGAT^ 

CGATTACC^GCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCG 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGC1AAGCTACTCA 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTC!AATTCTTATGTTACTCAvK3A 
TTGACGAAGGAACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACC^ 

CGGTTTCCGGATGGTAAAAC^GCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTA 
ATGGCCAAACATCAT T TAACAT TAGAAGAATA CATAAC T T TAT TAAAGGAGAATAAC CAA 

SEQ ID NO. 2209s SAG0093 FROM THE JM9130013 GBS TYPE VIII STRAIN 

AAGCCTAACAGTGAACAATCATCATCTCAAAAGTTGAGG 

CAATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCGTGAC 
CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGC^ 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCTTATGTTACTCAAGAGA 
TTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGC^G^ 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCLAGTTGAAAAAGATAGCTCCACAATATGG 
CGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGT^ 

ATGGCCAAACATCATTTAAaATTAGAAGAATA(^TAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2210s SAG0093 FROM THE M732 GBS TYPE III STRAIN 

AGCCTAACAGTCAACAATC^TCATCTGAAAAGTTGAGGA 

GATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCGTGACCAT^ 

CTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTGCTAGAGCAATTGATTC^ 
ATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGA 
TGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGC^ 
TGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAA 

GGT T T C CGGAT GGTAAAACAGCAGAAACAG GGGTAGGT TATGAAGATTGGCAT TAC CGC TATGT TGGGGTAGAGTC TG CAAAATATA 
TGGTCAAACATCATTTAAGATTAGAAGAATACAT^ 

SEQ ID NO. 2211s SAG0093 FROM THE M781 GBS TYPE III STRAIN 

AAGCCTAAC!AGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGG 
CGATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCA^ 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTGCTAGAGCAATTGATT 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATC^GGAGAAGTTGTTCS^ 

TTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCZAGCCTGCAG^ 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAa^TAGCTCCACAATATGGT 
CGGTTTCCGGATGGTAAAACAGCIAGAAAGAGGGGTAGGTTATGA^ 

ATGGTCAAACATCATTTAAC^TTAGAAGAATA<^TAACTTTATTAAAGGAGAATAACCM 



SEQ2 201 AGCCTAACAGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATAT^ 

SEQ2 202 AGCCTAACAGTCIAACAATCATCACCTCAAAAGTTGAGGAAT 

SEQ2 203 AGCCTAAC!AGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATATAAAAAAGATA 

SEQ2204 ACAGT CAA CAAT CAT CAT CT CAAAAGT TGAGGAATGAGGATATAAAAAAGATA 

SEQ2 205 AGCCTAACAGTCAACAATCATCATCTCAAAAGTTGAGGAATG^ 

SEQ2 206 AGCCTAAC^GTCAACAATCATCATCT^ 

SEQ2207 - -CCTAACAGTCAACAATC^TCATCTCAAAAGTTGAG^AATGAGGATATAAAAAAGACA 

SEQ2 208 AGCCTAACAGTCAACAATCATCATCTGAAAAGTTGAGGAAT 

SEQ2 209 AG C CTAACAGT CAA CAAT CAT CAT C T CAAAAGT TGAGGAATGAGGATATAAAAAAGATA 

SEQ2 210 AGCCTAACAGTCAACAATCATCATCTGAAAAGTTGAGGAATGAGG 

SEQ2 211 AGCCTAAC!AGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATATAAAAAAG 
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Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 



SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ2206 
SEQ2207 
SEQ2208 
SEQ2209 
SEQ2210 
SEQ2211 

SEQ22 01 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ22 05 
SEQ22 06 
SEQ2207 
SEQ2208 
SEQ220 9 
SEQ2210 
SEQ2211 

SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ22 06 
SEQ22 07 
SEQ22 08 
SEQ2209 
SEQ2210 
SEQ2211 



TCCTCTCAAAAAAGAAAT -AAGAAATT -ACAATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACGATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT -ACAATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT-AAGAAATT-ACAATTACC^GCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACGATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT -AAGAAATTTACAATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAATTAAGAAATT-ACGATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACGATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT -ACAATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - AC GAT TACCAG C TGTAT CAT CAAAAGAT TGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT -ACGATTACCAGCTGTATCATCAAAAGATTGGA 

ACTTGATTTTGGTCAATCGTGACGATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACT TGATTTTGGT CAAT CGTGA CCATAAA CATGAAGAAT T AAGT C CAGATGTGGT TCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 

TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
T TGAAAATAT T TAT T TGGATAAACGTAT TACGAAG CAAG C TAC TCAGT T T T TAGAGGC TG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 



SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ22 05 
SEQ2206 
SEQ2207 
SEQ2208 
SEQ2209 
SEQ2210 
SEQ2211 

SEQ2201 
SEQ2202 
SEQ2203 
SEQ22 04 
SEQ22 05 
SEQ22 06 
SEQ2207 
SEQ220 8 
SEQ22 0 9 
SEQ2210 
SEQ2211 



CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGC^ATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 

AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAAATGACTAGTAACCCTAATTTGACGAAGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAWGAAATGACTAGTAACCCTAATTTGACGAAGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
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Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 



SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ22 0 5 
SEQ22 0 6 
SEQ22 07 
SEQ2208 
SEQ22 0 9 
SEQ2210 
SEQ2211 



ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCT6CAGGTGCTAGTGAACACCAGA 
AC2UVGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
AC^GCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACAAGGAGAAAAGTTGGTAAAAACTTACTCT(^GCCTGO\GGTGCTAGTGAACACCAGA 
AC^GCAGAAAAGTTGGTAAAAACTTACTCTC^GCCTGCAGGTGCTAGTaAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGC^\GGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGGAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 



SEQ22 01 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ2206 
SEQ2207 
SEQ22 0 8 
SEQ2209 
SEQ2210 
SEQ2211 



CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 



SEQ22 01 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ22 06 
SEQ2207 
SEQ22 0 8 
SEQ2209 
SEQ2210 
SEQ2211 



TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 



SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ22 05 
SEQ2206 
SEQ2207 
SEQ2208 
SEQ2209 
SEQ2210 
SEQ2211 



AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 



SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ220 6 
SEQ2207 
SEQ22 0 8 
SEQ2209 
SEQ2210 
SEQ2211 



CTGCAAAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCGAACATCGTTTAAGA.TTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAAC^TC^TTTAAC^TTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGTCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGC^AAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGTCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGT CAAACATCATT TAACATTAGAAGAATACATAACTTTATTAAAGG 
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SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ22 05 
SEQ22 06 
SEQ2207 
SEQ220 8 
SEQ220 9 
SEQ2210 
SEQ2211 



AGAATAACCAA 

AGAATAA C CAA 

AGAATAACCAA 

AGAATAACCAAAACCCAGCTTTCTTGTACAA- 

AGAATAACCAA 

AGAATAACCAA 

AGAATAACCAAAACCCAGCTTTCTTGTACAA- 

AGAATAACCAA 

AGAATAACCAA 

AGAATAACCAAAACCCAGCTTTCTT 

AGAATAACCAATABCMARAWSTNCSRATNGTSAGDAANYDAANNC^BXYTDASAMYRT 



>SEQ ID NO 2250: 18_090 frame: 1 

KPNSQQ S S SQKLRNED IKKISS QKRNKKLQLPAVS S KDWNLI LVNRDHKHEEL S PDWPV 
ENIYIiDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNIjTRG 
QAEKLVKTYSQPAGASEHQTGIiAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGK 
TAETWGYEDWHYRYVGVESAKYMAKHHIiTLEEYITLLKENNQ 

>SEQ ID NO 2251: 18_1169NT frame: 1 

KPNSQQSSPQKLRNEDIKKISSQKRNKKLRLPAVSSKDWNLILVNRDHKHEELSPDWPV 
ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
QAEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGK 
TAE TGVGYEDWHYR YVGVE S AK YMAEHRL TL E E Y I TLL KENNQ 

>SEQ ID NO 2252: 18_18RS21 frame: 1 

KPNSQQ S S S QKLRNED I KKI S S QKRNKKLQLPAVS S KDWNL I LVNRDHKHE ELS PDWPV 
ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
QAEKLVKTYSQPAGASEHQTGIJ\MDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
TAE TGV GYEDWHYR YVGVE S AKYMAKHHLTLEE YI TLLKENNQ 

>SEQ ID NO 2253: 18_2603 frame: 3 

SQQSSSQKLRNEDIKKISSQKR1TKKLQLPAVSSKDWNLILVNRDHKHEELSPDVVPVENI 
YLDKRI TKQATQFLEAARAIDSREHLI SGYRSVAYQEKLFNS YVTQEMTSNPNLTRGQAE 
KLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGKTAE 
TGVGYEDWHYR YVGVE S AKYMAKHHLTLEE YI TLL KENNQNPAFLY 

>SEQ ID NO 2254: 18_A909 frame: 1 

KPNS QQ S S S Q KLRNED I KKTS S QKRNKKLRLPAVS S KDWNL I LVNRDH KHE EL S PDWPV 
ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTKE 
QAEKLVKTYSQPAGASEHQTGIiAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
TAETGV GYEDWHYR YVGVE SAKYMAKHHLTLE EYI TLLKENNQ 

>SEQ ID NO 2255:18_CJB110 frame: 1 

KPNSQQSSSQKLRNEDIKKISSQKRNKKFTITSCIIKRLELDFGQS 

>SEQ ID NO 2256:18_C0H1 frame: 1 

PNSQQSSSQKLRNEDIKKTSSQKRN 

>SEQ ID NO 2257: 18_H36B frame: 1 

KPNSQQSS SQKLRNEDIKKTSSQKRNKKLRLPAVS SKDWNLILVNRDHKHEELS PDWPV 
ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTXEMTSNPNLTKE 
QAEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
TAETGV GYEDWHYR YVGVES AKYMAKHHLTLE EYI TLLKENNQ 

>SEQ ID NO 2258: 18_JM9130013 frame: 1 

KPNS QQSSSQKiRNED IKKISS QKRNKKLQLPAVS SKDWNLILVNRDHKHEELS PDWPV 
ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
QAEKLVKTYSQPAGASEHQTGI^AMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
TAETGVGYEDWHYR YVGVES AKYMAKHHLTLEE YI TLLKENNQ 

>SEQ ID NO 2259:18_M732 frame: 3 

PNSQQSSSQKLRNEDIKKTSSQKRNKKLRLPAVSSKDWNLILVNRDHKHEELSPDWPVE 
NI YLDKR I TKQATQFLEAARAI DS REHL I S G YRSVAYQEKL FNS YVTQEMTSNPNLTRGQ 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGKT 
AETGVGYEDWHYRYVGVESAKYMVKHHLTLEEYITLLKENNQNPAF 
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(D-alanyl-D-alanine carboxypeptidase family protein) 

>SEQ ID NO 2260: 18_M781 frame: 1 

KPNSQQSSSQKLRNEDIKKTSSQKI^KKLRLPAVSSKI)Wl^ILVimDHKHEELSPDVVPV 
ENIYLDKRITKQATQFLEAAR^IDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
QAEKLVKTYSQPAGASEHQTGLAMDMSTVDSl^SDPRWSQLKKIAPQYGPVIiRFPDGK 
TAETGV GYEDWHYRYVGVESAKYMVKHHLTLEE YI TLLKENNQ 



SEQ22 5 0 
SEQ2251 
SEQ2252 
SEQ2253 
SKQ2254 
SEQ2255 
SEQ2256 
SEQ2257 
SEQ225 8 
SEQ225 9 
SEQ226 0 



PNS Q QS SSQKIjRNED I iKKISSQIO^NKKIjQL PA VSSKD 

PNS QQS S PQKLRNEDI KKI S S QKRNKKLRL PAVS S KDWNL I LVNRDHKHEELS PDVVPV 
PNSQQSSSQKLRNEDIKKISSQKMKKLQLPAVSSKDWNLILVNRDHKHEELSPDWPV 
--SQQSSSQKLRNEDIKKISSQKRIJKKLQLPAVSSKDWNLILVN^ 
PNSQQSSSQKIjRNEDIKKTSSQKPJmCLRLPAVSSKDWNIjILVimDHKHEELSPDWPV 

PNSQQSSSQKLRNEDIKKISSQKRNKKFTITSCIIKRLEL DFGQS 

PNSQQSSSQKLRNEDIKKTSSQKRN 

PNSQQS S S QKLRNEDI KKTS S QKRNKKLRIiPAVS S KDWNL I LVNRDHKHEELS PDWPV 
PNSQQSSSQKLRNEDIKKISSQKRNKXLQLPAVSSKDWNLILVNRDHKHEELSPDWP^ 
PNSQQS SS QKLRNEDI KXTSSQKRNKKLRLPAVSS KDWNL I LVNRDHKHEELS PDWPV 
PNSQQSSSQKLRNEDIK^TSSQKRNKXLRLPAVSSKDWNLILVNRDHKHEELSPDVVPV 



SEQ2250 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ2255 
SEQ2256 
SEQ2257 
SEQ2258 
SEQ2259 
SEQ2260 

SEQ2250 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ2255 
SEQ225 6 
SEQ2257 
SEQ2258 
SEQ2259 
SEQ2260 

SEQ2250 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ2255 
SEQ2256 
SEQ2257 
SEQ2258 
SEQ2259 
SEQ22 60 

SEQ2250 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ225 5 
SEQ2256 
SEQ2257 
SEQ2258 
SEQ22 5 9 
SEQ2260 



NI YLDKRI TKQATQFLEAARAIDSREHLI SGYRSVAYQEKLFNS YVTQEMTSNPNLTRG 
NIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
NI YLDKRI TKQATQFLEAARAIDSREHLI SGYRSVAYQEKLFNS YVTQEMTSNPNLTRG 
NI YLDKRI TKQATQFLEAARAIDSREHLI SGYRSVAYQEKLFNS YVTQEMTSNPNLTRG 
NI YLDKRI TKQATQFLEAARAIDSREHLI SGYRSVAYQEKLFNSYVTQEMTSNPNLTKE 



NIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTXEMTSNPNLTKE 
NI YLDKRI TKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNS YVTQEMTSNPNLTRG 
NIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
NI YLDKRI TKQATQFLEAARAIDSREHLI SGYRSVAYQEKLFNS YVTQEMTSNPNLTRG 

AEKLVKTYSQPAGASEHQTGI^AMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSIjNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 



AEKX.VKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ 

AETGVGYEDWHYRYVGVESAKYMAEHRLTLEEYITLLKENNQ 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQNPAFLY 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ - - - 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ 

AETGVGYEDWHYRYVGVESAKYMVKHHLTLEEYITLLKENNQNPAF 

AETGVGYEDWHYRYVGVESAKYMVKHHLTLEEYITLLKENNQTABLECMPARATIVESE 

ENCESRELATINGTSAGDALANYLDALANINECARBXYPEPTIDASEFAMILYPRTEIN 
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Table 23: Comparative Sequences relating to SAG0163 
(competence protein Cgl A) 

SEQ ID NO. 2301: SAG0163 FROM THE 090 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

GGCAGTAGAAGTAAATGCTCAAGATATTTATATCATTCCCA^ 

GTTTATTGATGTTTTTGAGTTTAATAGG&TGGCTAGTCTTATTAGTCACTTTAAATTTGTG^ 

ACGAAGTGAATTAGGTTCTTGTGACTATGAACTGTCAG 

TCAAGAATCTTTAGTTATTCGTATTTTGTATT 

TAAAAATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGAC^ 

AATGACTTATGATGCTTTAATCAAACTGTCTTTACGGCATCGTCCAGATATTTTAATTATCGGAGAGATTAGAGATC^ 

CCGTGCTGTTATTCGTGC^GTTTAACGGGAGTGATGGTTTTTTCTACTATTC^TGCTAAAAGTATTTCCGGAGTCTATGATAGGCT 

TATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTA^^ 

TGACTTTGAGACAGGTAACTTTAAAAAACACTaAT 

TAAGAAACAGGCACAAGTCGAAAAAATTATCCCTCAAGAAACA^ 

SEQ ID NO. 2302: SAG0163 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

GGTGATTGTTATGAAACCTCTACTATTGCGTATTTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTC 
TTATTAGTCACTTTAAATTTGTGGCA.^ 

AGGGAAGACTGGTTTCATTACGACTATCGAGTGTG<3GAGATTATCGTGGTC^ 

ATCAGGACTTAAAATATTGGTTTGATAATATAAAGCAAATGAAGGAAGTA 

TGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAA^^ 

AAATCAAGAATGACAAGATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTT^ 

ATCGTCCAGATATTTTAATTATCGGAGAGATT^ 

TT TT T T CTACTATT CAT GC TAAAAGTAT T C CC GGAGT CTATGATAGGC T TATAGAATTAGGGGT TAA CTAT CAAGAGT TAGAAAATA 
GTCTAAAATTAATAGCATATCAACGTTT^^ 

ACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAAGGATATATCAGTAAGAAACAGGC^ 
AAACAACGGAAAGTAGTCCAACTTTT 

SEQ ID NO. 2303: SAG0163 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE COMPLEMENT) 

GTTCAATCATTAGCAAAGCAAGTCATTCATCAG^ 

GAACTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAA 
TTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTT^ 
TTACGACTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATTCGTATTT^ 
TGGTTTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTT^ 

ACTCTCATGTATCAATTAGCTT CAGAAGTATT TAAAAATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAA 
ATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTTATGATG 

GCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAATAG 

TATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT^ 

GTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGCACAAGTCGAAAAAATT^ 

CCAACTTTT 

SEQ ID NO. 2304: SAG0163 FROM THE 2603 V/R GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

GATATTTATATCATTCCCAAAGGTGATTGTTATGAACTCTATATGCGTATTGaTGATGAAAGGCGGTTTATTGATGTTTTTGAGTTT 
AATAGGATGGCTAGTCTTATTAGTCACTTTAAA 

GACTATGAACTGTCAGAGGGAAGACTGGTTTC^TTACGACTATCGAGTGTGGGAGATTATCGTGGTCMGAATCTTTAGTTATTCGT 
ATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGTTTGATAATATAAAGCAAATGAAGGA^ 
CTTTTTTCCGGCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGC^ 
ATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAACTCC^TTGAATGAGGATATTG 

AAACTGTCTTTACGGCATCGTCCAGATATTTTAATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTG 

TTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTAT 

CAAGAGTTAGAAAATAGTCTAAAATTAATAGCATATCAACGTTTA^ 

AAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGA^ 

AAAAATTATCCCTCAAGAAACAACGGAAAGTAGTCCAACTTTT 
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Table 23: Comparative Sequences relating to SAGO 163 
(competence protein Cgl A) 

SEQ ID NO. 2305: SAGO 16 3 PROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

GTTCAATCATTAGCAAAGCAAGTCATTCATC^ 

G^CTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAA 

TTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACTATGAACTGTCAGAGG 

TTACGACTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTT^ 

TGGTTTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCGGCCCTGTGGGGAGTGGTAA 
ACT CTCATGTATCAATTAGCTTCAGAAGTATTTAAAAATAAGCAAATT^ 
ATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATC^^ 
ATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTT^ 

GCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAAT^ 

TATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGTAATTTTA^ 

GTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGC^ 

CCAACTTTT 

SEQ ID NO. 2306: SAGO 16 3 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE COMPLEMENT) 

GTTCAATCATTAGCAAAGCAAGTCATTCATCAG 

GAACTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAA 

TTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACTATGA 

TTACGACTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATTCGTAT^ 

TGGTTTGATAATATAAAGCAAATGAAGGAAGTACTGGGTACAAGAGGGCTAT^ 

ACTCTGATGTATCAATTAGCTTCAGAAGTATTTAAAAATAAGCAAATTATC^ 

ATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGCAT 

ATTATCGGAGAGATTAGAGATC^GCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCAT 

G CT AAAAGTAT TTCCGGAGT C TAT GATAGGC T TATAGAAT TAGGGGTTAAC TAT CAAGAGT TAGAAAATAGT C TAAAATTAATAGCA 

TATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGTAACTTTAAAAA^ 

GTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGCACAAGTCGAAAAA^ 

CCAACTTTT 

SEQ ID NO. 2307: SAG0163 FROM THE COHl GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

AGGTGATTGTTATGAAATTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTA 
TTAGTG&CTTTAAATTTGTCGCAGGCATGAACGTTGGA 

GAAGACTGGTTTCATTACGACTATCIAAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATTCGTACTTTGTATT 
AGGACTTAAAATATTGGTTTGATAATATAAAGTAAATGAAGGAAGTACTGTGTGC^GAGGGCTATATCTTTTTTCCGGCCCTGTGG 
GGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAAAT^ 
TCAAGAATGACAAGATGTTACAACTCCAATTGAATGAGGATATTGGAATG^ 

GTCCAGATATTTTAATTATCGGAGAGATTAGAGATC^GCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTAATGGTTT 
TTT CTACTAT T CATGC TAAAAGTATT C CCG GAGT C TAT GATAGGC T TATAGAAT TAGGGGT T AACTAT CAAGAGT TAGAAAATAGT C 
TAAAATTAATAGCATATCAACGTTTAATTGGAGGAG 

AGTGGAATAGACAAGTGGATATCTTGGCTGAAGAAGGACATATCAGTAAG 
CAACGGAAAGT AGT C CAAC T TT T 

SEQ ID NO. 2308: SAG0163 FROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

TCATTAGCAAAGCAAGTCATTCATC1AGGCAGTAGA 

TATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAATTTGTG 

GCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACTA 

CTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATTCGTATTTTGTATT(^ 

GATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGG 

ATGTATC!AATTAGCTTCAGAAGTATTTAAAAATAAGCAAATTAT(^ 

CAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTC 

GGAGAGAAATAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTGATGTTTTTTTCTACTATTCATG 

AAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGA^ 

ACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTG^GACAGGTAATTTTAAAAA 

TATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGCACAAGTCGAAA 

TTTT 
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Table 23: Comparative Sequences relating to SAG0163 
(competence protein CglA) 

SEQ ID NO. 2309: SAG0163 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE COMPLEMENT) 

GTTCAATCATTAGCAAAGCAAGTCATTCATCAGGCAGTAGAAGTA^ 
GAACTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATA^ 
TTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAG^ 
TTACGACTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATT^ 

TGGTTTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCGGCCCTGTGGGGAGTGGTAA 

ACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAAATAAGCAAATTA^ 

ATGTTAC^CTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACT 

ATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCAT 

GCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAATAGCA 

TATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGTAATTTTAA^ 

GTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGC^ 

CCAACTTTT 

SEQ ID NO. 2310: SAG0163 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

TGACTTGTTATGAAACTCTATATGCGTATTTGATGATGAAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTA 

TTAGTCACTTTAAATTTGTGGCAGGCATGAACGTTGGAGAAAAAAG^ 

GAAGACTGGTTTCATTACGACTATC^GTGTGGGAGATTATCGTGGTCAAGAATCT 

AGGACTTAAAATATTGGTTTGATAATATAAAGTAAATGAAGGAAGTACTGTGTGCAAGAGGGCTATATCTTTTTTCCGGCCCTGTGG 

GGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAAAT^ 

TCAAGAATGACAAGATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAAT 

GTCCAGATATTTTAATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTAATGGTTT 

TTTCTACTATTCATGCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGT^ 

TAAAATTAATAGCATATCAACGTTTA&TTG 

AGTCX3AATAGACAAGTGGATATCTTGGCTGAA 

CAACGGAAAGTAGTCCAACTTTT 

SEQ ID NO. 2311: SAGO 163 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

C^GTAGAAGTAAATGCTCAAGATATTTATAT 

TTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTC^CTTTAAATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGAC 

GAAGTCAATTAGGTTCTTGTGACTATGAACTGTCAGAGGGAAGACTGGTTTCATT^ 

AAGAATCTTTAGTTATTCGTACTTTGTATTGAGGTCATCAGGACTTAAA^ 

TGTGTGCAAGAGGGCTATATCTTTTTTCCGGCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTA 
AAAATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACA^ 

TGACTTATGATGCTTTAATCAAACTGTCTTTACGGCATCGTCCAGATATTTTAATTATCGGAGAGATTAGAGATCAAGCGACGGCCC 

GTGCTGTTATTCGTGCAAGTTTAACGGGAGTAATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCCGGAGTCTATGATAGGCTTA 

TAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTG 

ACTTTGAGACAAGTAACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCT 

AGAAACAGGCACAAGTCGAAAAAATTATCCCTCAAGAAAC^CGGAAAGTAGTCCAACTTTT 
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Table 23: Comparative Sequences relating to SAG0163 
(competence protein CglA) 

SEQ2 301 GGCAGTAGAAGTAAATGCTCAAGATATT 

SEQ2302 

SEQ2 303 T TCAATCATTAGCAAAG CAAGT CATT CAT CAGGCAGTAGAAGTAAATG CTCAAGATAT T 

SEQ2304 GATATT 

SEQ2 305 TTCAATCATTAGCAAAGCAAGTCATTCATCAGGC^GTAGAAGTAAATGCTCAAGATATT 

SEQ2 306 TTCAATCATTAGCAAAGGAAGTGATTCATCAGGCAGT^ 

SEQ2307 

SEQ2 308 T CATTAGCAAAG CAAGT CAT T CAT CAG G CAGTAGAAGTAAATG C T CAAGATAT T 

SEQ2 309 TTCAATCATTAGCAAAGCAAGTCATTCATCAGGC^ 

SEQ2310 

SEQ2 311 CAGTAGAAGTAAATGCTCAAGATATT 

SEQ2301 ATATCATTCCCAAAGGTGA - TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ23 02 GGTGA - TTGTTATGAA -ACCTCTACTATTGCGTATTTGATGATGA 

SEQ2303 ATATCATTCCCAAAGGTGA - TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ2 304 ATATCATTCCCAAAGGTGA - TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ2 305 ATATCATTCCCAAAGGTGA- TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ230 6 ATATCATTCCCAAAGGTGA -TTGTTATGAA -CTCTATA TGCGTATT -GATGATGA 

SEQ2 307 -AGGTGA- TTGTTATGAAATTCTATA TGCGTATT - GATGATGA 

SEQ2 308 ATATCATTCCCAAAGGTGA- TTGTTATGAA- CTCTATA TGCGTATT- GATGATGA 

SEQ2309 ATATCATTCCCAAAGGTGA - TTGTTATGAA - CTCTATA TGCGTATT- GATGATGA 

SEQ2310 TGACTTGTTATGAAACTCTATA TGCGTATTTGATGATGA 

SEQ2 311 ATATCATTCCCAAAGGTGA - TTGTTATGAA - TTCTATA TGCGTATT - GATGATGA 

SEQ2 301 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 302 AA - GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGC TAGTCTTATTAGTCACTTTA 

SEQ23 0 3 AA - GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCAC TTTA 

SEQ2 304 AA-GGCGGTT TAT TGATGTTTTTGAGTTTAATAGGATGGC TAGTCTTATTAGTCACTTTA 

SEQ23 0 5 AA-GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 306 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 307 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 308 AA - GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 309 AA - GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCT TATTAGTCACTTTA 

SEQ2 310 AAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 311 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 301 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 302 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 303 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

S EQ2 304 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 305 AATTTGTGGC^GGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 306 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 307 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 308 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 309 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 310 AATTTGTGGCAGGGATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 311 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 
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Table 23: Comparative Sequences relating to SAG0163 
(competence protein CglA) 



SEQ2301 
SEQ2302 
SEQ23 03 
SEQ2304 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ2308 
SEQ2309 
SEQ2310 
SEQ2311 

SEQ2301 
SEQ2302 
SEQ2303 
SEQ2304 
SEQ23 05 
SEQ2306 
SEQ2307 
SEQ2308 
SEQ2309 
SEQ2310 
SEQ2311 



ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCAAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCAAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCAAGTGTGGGAGATTATCGTG 

GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTC^GGTCATaVGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTGATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTACTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCT TTAGTTATTCGTATTT TGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCT TTAGTTATTCGTACTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTC^AGAATCTTTAGTTATTCGTACTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 



SEQ2301 
SEQ2302 
SEQ2303 
SEQ2304 
SEQ230 5 
SEQ2306 
SEQ2307 
SEQ2308 
SEQ2309 
SEQ2310 
SEQ2311 



T TGATAAT ATAAAGCAAATGAAGGAAGTAC TG GGTACAAGAGGG CTATAT CTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTACAAGAGGGCTATATCTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCG 
T TGATAATATAAAG CAAATGAAGGAAGTAC TGGGTATAAGAGGG C TAT AT C TTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTACAAGAGGGCTATATCTTTTTTCCG 
TTGATAATATAAAGTAAATGAAGGAAGTACTGTGTGCAAGAGGGCTATAT CTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCG 
T TGATAATATAAAGTAAATGAAGGAAGTA C TGTGT G CAAGAGGGCTATAT CTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGTGTGCAAGAGGGCTATATCTTTTTTCCG 



SEQ2301 
SEQ23 02 
SEQ2303 
SEQ2304 
SEQ2305 
SEQ23 0 6 
SEQ2307 
SEQ23 0 8 
SEQ2309 
SEQ2310 
SEQ2311 



GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 



GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
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Table 23: Comparative Sequences relating to SAG0163 
(competence protein Cgl A) 



SEQ2301 
SEQ2302 
SEQ2303 
SEQ2304 
SEQ2305 
SEQ230 6 
SEQ2307 
SEQ230 8 
SEQ2309 
SEQ2310 
SEQ2311 



ATAAGCAAATTATCACGATTG^^GATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAA^ 

ATAAGCAAATTATC^CGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATC7VAGAATGACAAGATGTTACAA 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 



SEQ2301 
SEQ2302 
SEQ2303 
SEQ23 04 
SEQ23 05 
SEQ2306 
SEQ23 07 
SEQ23 0 8 
SEQ2309 
SEQ2310 
SEQ2311 



TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 



TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 



SEQ23 01 
SEQ23 02 
SEQ23 03 
SEQ23 04 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ23 0 8 
SEQ2309 
SEQ2310 
SEQ2311 



ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
AT C GT C CAGATAT TT TAAT TAT CGGAGAGAT - TAGAGATCAAGCGACGGCTCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
AT CGTCCAGATATTTTAAT TAT CGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTC CAGATAT TT TAAT TAT CGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTC CAGATATTTTAATTAT CGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAAATAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 



SEQ23 01 
SEQ2302 
SEQ23 0 3 
SEQ2304 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ23 0 8 
SEQ23 0 9 
SEQ2310 
SEQ2311 



ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTTCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 

ATTCGTGCAAGTTTAACGGGAGTAATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGTTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTAATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTAATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 



SEQ2301 
SEQ23 02 
SEQ23 0 3 
SEQ23 04 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ230 8 
SEQ2309 
SEQ2310 
SEQ2311 



GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTC TAT GATAGGC T TATAGAAT TAGGGGT TAA C TAT CAAGAGT TAGAAAATAGT CTA 
GGAGT C TATGATAG G C T T ATAG AAT TAGGGGT TAA C TAT CAAGAGT TAGAAAATAGT C TA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGT C TAT GATAGGC T TAT AGAA TTAGGGGT TAA C TAT CAAGAGT TAGAAAATAGT CTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 



SEQ23 01 
SEQ2302 
SEQ2303 
SEQ23 04 
SEQ23 0 5 
SEQ2306 
SEQ23 07 



AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGT 
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SEQ230 8 
SEQ230 9 
SEQ2310 
SEQ2311 

SEQ2301 
SEQ2302 
SEQ2303 
SEQ2304 
SEQ23 0 5 
SEQ2306 
SEQ2307 
SEQ230 8 
SEQ2309 
SEQ2310 
SEQ2311 



AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAC3ACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGT 

AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGAC^GTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 



SEQ2301 
SEQ23 02 
SEQ2303 
SEQ2304 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ2308 
SEQ2309 
SEQ2310 
SEQ2311 

SEQ2301 
SEQ2302 
SEQ23 03 
SEQ23 04 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ2308 
SEQ2309 
SEQ2310 
SEQ2311 



GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGATATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCIAGAAGTGCGAAA 

GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATC CCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATC CCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 



>SEQ ID NO 2350:63_090 frame: 2 

AVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFVAGMNVGEKRRS 
QLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDNIKQMKEVLGTR 
GLYIjFSGPVGSGKTTLMYQIASEVFKNKQIITIEDPVEIKNDKMLQLQIiNEDIGMTYDAL 
IKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSISGVYDRLIELGVNYQ 
ELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDIIiAEEGHISKKQAQVEKII 
PQETTESSPTF 

>SEQ ID NO 2351: 63_1169NT frame: 3 

. LL .NLYYCVFDDERRFIDVFEFNRMASLISHFKFVAGMNVGEKRRSQLGSCDYELSEGR 
LVSLRLSSVGDYRGQESIiVIRILYSGHQDIiKYWFDNIKQMKEVLGTRGLYLFSGPVGSGK 
TTLMYQLAS EVFKNKQ 1 1 T I EDP VE I KNDKMLQLQLNED I GMT YDALI KLS LRHRPDIL I 
IGEIRDQATARAVIRASIiTGVMVFSTIHAKSIPGVYDRLIELGVNYQELENSLKLIAYQR 
LIGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGYISKKQAQVEKIIPQETTESSPTF 

>SEQ ID NO 2352:63_18RS21 frame: 1 

VQSLAKQVIHQAVEVNAQDI YI IPKGDCYELYMRIDDERRFIDVFEFNRMASLI SHFKFV 
AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 
IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 
NED I GMT YDAIj I KLSLRHRPDI L 1 1 GE IRDQATARAVI RASLTGVMVF S T I HAKS I PGV Y 
DRLIELGVIJYQELENSLICLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDIIiAEEGHI 
SKKQAQVEKI I PQETTES S PTF 

>SEQ ID NO 2353: 632603 frame: 1 

DIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASIilSHFKFVAGMNVGEKRRSQLGSCDY 
ELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDNIKQMKEVLGIRGLYLFSG 
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PVGSGKTTLMYQLAS EVFKNKQ 1 1 TI EDP VE I KNDKMLQLQLNED I GMT YDAL I KL S LRH 

RPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVYDRLIELGVNYQELENSLK 

LIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHISKKQAQWKNY 

.SNF 

>SEQ ID NO 2354:63_A909 frame: 1 

VQ S LAKQ V I HQ A VEVN AQD I Y 1 1 PKGDCYELYMRIDDERRF IDVFEFNRMASLI SHFKFV 
AGMNVGEKRRS QLGS CD YELS EGRLVS LRL S S VGD YRGQE SLVI RI L YS GHQDLKYWFDN 
IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIICNDKMLQLQL 
NEDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 
DRLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 
SKKQAQVEKII PQETTES S PTF 

>SEQ ID NO 2355:63_CJB110 frame: 1 

VQSLAKQVIHQAVE VNAQD I YI I P KGD C YEL YMRIDDERRF IDVFEFNRMASLI SHFKFV 
AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 
IKQMKEVLGTRGLYLFSGPVGSGKTTLMYQLASEVFKNKQI ITIEDPVEIKNDKMLQLQL 
NEDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSISGVY 
DRLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 
SKKQAQVEKI I PQETTESS PTF 

>SEQ ID NO 2356:63_CJB110 frame: 1 

VQS LAKQ V I HQ AVE VNAQD I Y 1 1 PKGDC YEL YMRIDDERRF IDVFEFNRMASLI SHFKFV 

AGMNVGEKRRS QLGS CD YELS EGRLVSLRLS S VGD YRGQE SLVI RILYS GHQDLKYWFDN 

I KQMKEVLGTRGL YLF S GPVGS GKTTLM YQLASEVFKNKQ 1 1 TI ED PVE I KNDKMLQLQL 

NEDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSISGVY 

DRLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SKKQAQVEKI I PQETTESS PTF 

>SEQ ID NO 2357: 63_H36B frame: 1 

SLAKQVIHQAVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFVAG 
MNVGEKRRSQLGS CDYELS EGRLVSLRL S S VGD YRGQE SLVI RIL YSGHQDLKYWFDNI K 
QMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQLNE 
DIGMTYDALIKLSLRHRPDILIIGEK 

>SEQ ID NO 2358 :63_JM913 0013 frame: 1 

VQSLAKQVIHQAVEVNAQDIYI I PKGDCYELYMRIDDERRFIDVFEFNRMASLI SHFKFV 
AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 
IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQI ITIEDPVEIKNDKMLQLQL 
NEDIGMTYDALI KLSLRHRPDILI IGE I RDQATARAVIRASLTGVMVFSTIHAKS I PGVY 
DRLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 
SKKQAQVEKI I PQETTESS PTF 

>SEQ ID NO 2359:63_M732 frame: 3 

TCYETLYAYLMMKRRFIDVFEFNRMASL I SHFKFVAGMNVGEKRRSQLGS CDYELS EGRL 
VSLRLS S VGD YRGQES LVI RTL YS GHQDLKYWFDNI K . MKEVL CARGL YL FSGP VGSGKT 
TLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQLNEDIGMTYDALIKLSLRHRPDILII 
GE I RDQATARAVIRASLTGVMVFSTIHAKS I PGVYDRLIELGVNYQELENSLKLIAYQRL 
IGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGHISKKQAQVEKIIPQETTESSPTF 

>SEQ ID NO 2360 : 63_M781 frame: 3 

VEVNAQD I YI I PKGDC YEF YMRIDDERRF I DVFEFNRMASL I SHFKFVAGMNVGEKRRSQ 
LGS CDYEL S EGRLVSLRLS S VGD YRGQE SLVI RTL YSGHQDLKYWFDNI KQMKEVL CARG 
L YL FSGP VGS GKTTLM YQLAS EVF KNKQ 1 1 TI EDP VE I KNDKMLQLQLNED IGMT YDAL I 
KLSLRHRPDIL I IGEI RDQATARAVIRASLTGVMVFSTIHAKS I PGVYDRLIELGVNYQE 
LENSLKLIAYQRLIGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGHISKKQAQVEKIIP 
QETTESSPTF 

>SEQ ID NO 2361:63_COHl frame: 3 

VIVMKFYMRIDDERRFIDVFEFNRMASLISHFKFVAGMNVGEKRRSQLGSCDYELSEGRL 
VS LRL S S VGD YRGQESLVI RTL YS GHQDLKYWFDNI K 
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SEQ2350 AVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 

SKQ2 351 LLNLYYCVFDDERRFIDVFEFNRMASLISHFKFV 

SEQ2 352 QSLAKQVI HQAVEVNAQD I YI I PKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 

SEQ2353 - D I YI I PKGDC YEL YMRI DDERRF I DVFEFNRMAS L I S HFKFV 

SEQ2354 QSLAKQVIHQAVEVNAQD I YI I PKGDC YEL, YMRI DDERRFI DVFEFNRMAS L I SHFKFV 

SEQ2355 QSIiAKQVIHQAVEVNAQDI YI I PKGDCYEL YMRIDDERRFIDVFEFNRMASLI SHFKFV 

SEQ23 56 QSLAKQVIHQAVEVNAQDI YIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 

SEQ2357 -SliAKQVIHQAVEWAQDIYI I PKGDCYELYMRIDDERRFIDVFEFNRMASLI SHFKFV 

SEQ2 358 QSLAKQVIHQAVEVNAQDI YI I PKGDC YEIi YMRI DDERRF IDVFEFNRMAS L I S HFKFV 

SEQ2359 TC YETL YA YLMMKRRF I DVFE FNRMAS L I SHFKFV 

SEQ2 360 VEVNAQDI YI I PKGDCYEF YMRIDDERRFIDVFEFNRMASLI SHFKFV 

SEQ2361 . VI VMKF YMRIDDERRFIDVFEFNRMASLI SHFKFV 

SEQ2 350 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 

SEQ2 351 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESIiVIRILYSGHQDLKYWFDN 

SEQ2 352 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 

SEQ2 353 AGMNVGEKRRS QLGS CDYELSEGRLVS LRLS S VGD YRGQESLVI R I L YS GHQDLKYWFDN 

SEQ2 354 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 

SEQ2 355 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 

SEQ2 356 AGMNVGEKRRS QLGS CD YELSEGRLVSLRLSS VGD YRGQESLVI RILYS GHQDLKYWFDN 

SEQ2 357 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 

SEQ2 358 AGMNVGE KRR S QL G S CD YE L S E GRLVS LRL S S VGD YRGQE S L V I R I LYSGHQDLKYWFDN 

SEQ2 359 AGMNVGEKRRS QLGS CDYELSEGRLVS LRLS S VGD YRGQESLVIRTLYSGHQDLKYWFDN 

SEQ2 360 AGMNVGEKRRSQLGSCD YELSEGRLVSLRLSS VGD YRGQESLVIRTLYSGHQDLKYWFDN 

SEQ2361 AGMNVGEKRRSQLGSCD YELSEGRLVSLRLSS VGD YRGQESLVIRTLYSGHQDLKYWFDN 

SEQ2350 IKQMKEVLGTRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2351 IKQMKEVLGTRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2352 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2353 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQIiASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2 354 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2 355 I KQMKEVLGTRGL YLFS GPVGS GKTTLM YQLASEVFKNKQ I ITIEDPVEIKNDKMLQLQL 

SEQ2 356 I KQMKEVLGTRGL YLFS GPVGS GKTTLMYQLAS EVFKNKQ I ITIEDPVEIKNDKMLQLQL 

SEQ2357 I KQMKEVLGI RGL YLFSGP VGS GKTTLMYQLAS EVFKNKQ 1 1 T I EDPVE I KNDKMLQLQL 

SEQ2 358 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQI ITIEDPVEIKNDKMLQLQL 

SEQ2 359 IK- MKEVLCARGLYLFSGPVGSGKTTLMYQLASEVFKNKQI ITIEDPVEIKNDKMLQLQL 

SEQ23 6 0 IKQMKEVLCARGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDI^ILQLQL 

SEQ2361 IK 

SEQ2350 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSISGVY 

SEQ2351 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2352 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2353 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2354 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2355 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSISGVY 

SEQ2356 ED I GMT YDAL I KLS LRHRPDI L 1 1 GE I RDQATARAVI RAS LTGVMVFS TIHAKS I S GVY 

SEQ2357 EDIGMTYDALIKLSLRHRPDILIIGEK 

SEQ2358 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2359 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2360 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2361 

SEQ2350 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2351 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETSNFKKHSSDKIWTRQVDILAEEGYI 

SEQ2352 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2353 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2354 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2355 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2356 RLIELGWYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2357 

SEQ2358 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ23 59 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGHI 

SEQ23 60 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGHI 

SEQ23 61 
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SEQ2350 
SEQ2351 
SEQ2352 
SEQ2353 
SEQ2354 
SEQ2355 
SEQ2356 
SEQ2357 
SEQ2358 
SEQ2359 
SEQ2360 
SEQ2361 



KKQAQVEKI IPQETTESSPTF 
KKQAQVEKI IPQETTESSPTF 
KKQAQVEKI IPQETTESSPTF 
KKQAQVRKNYPSRNNGKSNF - 
KKQAQVEKI IPQETTESSPTF" 
KKQAQVEKI IPQETTESSPTF 
KKQAQVEKI IPQETTESSPTF 

KKQAQVEKI IPQETTESSPTF 
KKQAQVEKIIPQETTESSPTF 
KKQAQVEKI IPQETTESSPTF 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 

SEQ ID NO. 2401: SAG0290 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

GTATGAGTTCAGGCGTGAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACC^ 

TCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTA^ 

AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTT CAACAGGTATTGATGCAGGGAAATTTGATTTAT CA 

GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGACCCTATATCCCGTTCAAA 

TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAA^ 

TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAA^ 

AAAATCAAATATGTTTCTGGKGACAACTGGTGTTACTAGCAGATTAAAAAAT^ 

CTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAAGATCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2402: SAG0290 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE 
COMPLEMENT) 

GTATC^GTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTA(^GATTCTGAC^CGGCACCATTTACTTA 
TRAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 
GCTAATGATTTTTGATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGC CGTAGTAGGGAAGAAGGGGAGCCATTACAAAT CAT TAAGTGACCT CTCTGGAAAATCAACCGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2403: SAG0290 FROM THE 2603 V/R GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

ATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGA 
CAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCA 
TACAATAAAGAAAGAGCAGAAAAATAT CTCTT CTCAGAT C C TAT AT C C CGTTCAAATTATGC CGTAGTAGG 
GAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTATCTGGCGTTA 
ACTATGCAC^GGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATAAAAATCAAATATG 
TCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGACTTTATCCTATATGA 
TGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAA 
TTGGTAATAATAAGGATGGACTAGAATACCT C CTTTTACCAAAAGATAAAAAAG 

SEQ ID NO. 2404: SAG0290 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 
GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTCCT 
TTGAAAGGTAAAATTGGT AAT AAT AAGGATGGACTAGAATAC CT C CT TTTAC CAAAAGAT AAAAAAGGTAAAACTCTA 
CAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACAATATTTCGGT 
GGAGATTAC GTT TC AAAC ATTGAT AAA 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 

SEQ ID NO. 2405 : SAG0290 FROM THE A909 GBS TYPE la STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTiyU^GTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGT^ 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTGAACAG^ 
GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTC 

TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACC 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATC^ 

AAAATNAAATATGTTTCTGGGAGAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCT CCTTTTAC CAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGT 

SEQ ID NO. 2406: SAG0290 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 
GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAA^ 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATA^ 

AAAATGAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2407: SAG0290 FROM THE COH1 GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 
GCTAATGATTTTTCATATAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGC CGTAGTAGGGAAGAAGGGGAGC CATTACAAAT CATTAAGTGACCTCTCTGGAAAAT CAACAGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTG(^ACAACTGGTGTTACTAG(^GATTAAAAAATATTGAGAGTGGAAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAAGATCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTAGA.GAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2408: SAG0290 FROM THE H36b GBS TYPE lb STRAIN 
(REVERSE COMPLEMENT) 

GTAT CAGTT CAGGCGT CAGAGAAAGTAGAACTTAAAGTAG CTACAGATT CTGACACGGCAC CATTTACTTA 

TCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 

AAGTAACCTTC^GACAGTTCCTTTTGATACTATTTCAAC^GGTATTGATG(^GGGAAATTTGATTTATCA 

GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 

TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAA 

TTTTATCTGGCGTTAA.CTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAZ^ACCAATA 

AAAATCAAATATGTTTCTGGGAGAACTGGTGTTACTAGCAGATTAAAA2\ATATTGAGAGTGGGAAAATTGA 

CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 

CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 

AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAG 

TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 



SKQ ID NO. 2409: SAG0290 FROM THE JM9130013 GBS STRAIN VIII (REVERSE 
COMPLEMENT) 

GTATC^GTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGC^CCATTTACTTA 

TCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAA 

AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGAT^ 

GCTAATGATTTTTCATAC^TAAAGAAAGAGCAGAAAAATATCTCTTCT(^GATCCTATATCCCGTTCA^ 
TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAA.TCATTAAGTGACCTCTCTGGAAAATCAACCGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGA2\AATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTAATAAAGTTTTGAAAGAAAATGGTA 

SEQ ID NO. 2410: SAG0290 FROM THE M732 GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTGAGAGAAAGTAGA&CTTAA 

TCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTGAAAGCTGTTTTTAAAGGTAGTAAGTAC^ 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 
GCTAATGATTTTTCATATAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACAGAAG 
TTTTATCTGGCGTTAACTATGCAC^GGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGAAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAAGATCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2411: SAG0290 FROM THE M781 GBS TYPE III STRAIN (REVERSE 
COMPLEMENT ) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TCAAAAAGACG(^AAATTCAAAGGTTATGACGTTGATGTTGTCAAA.GCTGTTTTTAAAGGTAGTAAGTACA 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 
GCTAATGATTTTTCATATAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TT ATGC CGTAGTAGGGAAGAAGGGGAGC CATTACAAAT CATTAAGTGACCTCTCTGG AAAAT CAACAGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATC^TCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGAAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAAGATCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ2 401 TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
SEQ2 402 TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 



SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 



TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTT CAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATT CTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 



SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 



GATTTACTTAT CAAAAAGACGGGAAAT TCAAAGGT TATGATGTTGATGTTGTCAAAGCT 
CATTTACTTATRAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 



•ATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 



CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 

CATTTACTTAT CAAAAAGACGGGAAAT TCAAAGGT TATGATGTTGATGTTGTCAAAGCT 

CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 

CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTCAAAGCT 

CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGATGTT^ 

CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 



SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ240 6 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ24X0 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ240 8 
SEQ2409 



CATTTACTTATCAAAAAGACGGGAAATTCAAAG<3^ 

CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTCAAAGCT 

GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GT T T TTAAAGGTAGTAAGT ACAAAGTAACC TT CAAGACAGT T C G T T 'IT GATACT AT T T CA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAA.GGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 

ACA.GGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGC^GGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATATAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATATAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATATAATAAAGAA 

AGAGCAGAAAAATATCTCTTCTCAGAC CCTATATCCCGTT CAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGAT CCTATATCCCGTT CAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCT CAGATCCTATATC CCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 

AAGAAGGGGAG CCAT T ACAAAT CAT TAAGTGACCT C T C TGGAAAAT CAACAGAAGT T T TA 
AAGAAGGGGAGCC^TTACAAATCATTAAGTGACCTCTCTGGAAAATO^CCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAA.TCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCT CTCTGGAAAATCAACCGAAGT TTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACAGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATC^TTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATC^CAGAAGTTTTA 
AAGAAGGGGAGCCATTAC^U^TCATTAAGTGACCTCTCTGGAAAATCAACAGAAGTTTTA 

TCTGGCGTTAACTATGCAC^GGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGC^C^GGTTCTAGAAAATTGGAATAAAAATCATNNTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTG<3CGTTAA.CTATGCACAGX3TTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGT TAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCAT CCTAATAAAAAA 
TCTGGCGTTAACTATG(^CAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCACZAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCAC^GGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 

CC^TAAAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAAT CAAATATGTTT CTGGGACAACTGGTGT TACTAGCAGATTAAAAAATAT T 
C CAATAAAAAT CAAATATGTTT CTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CC^TAAAAATNAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAAT CAAATATGTTT CTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAATCAAATATGTTTCTGGGACAACTGX3TGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAATCAAATATGTTTCTGGX ACTGGTGTTACTAGCAGATTAAAAAATATT 
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(ABC transporter, substrate-binding protein) 



SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ240 8 
SEQ24 09 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ240 6 
SEQ2407 
SEQ2408 
SEQ24 0 9 
SEQ2410 
SEQ2411 

SEQ240X 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ240 6 
SEQ2407 
SEQ2408 
SEQ240 9 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ240 6 
SEQ2407 
SEQ240 8 
SEQ240 9 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ240 9 
SEQ2410 
SEQ2411 



CC^TAAAAATCAAATATGTTTCTGGGAC^CTGGTGTTACTAGCAGATTAAAAAATATT 
C CAATAAAAATCAAATATGTTT CTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 

GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGGAAAAT TGACT T TAT C CTATATGATG C CAT TT CAT C C GAC TATAT TGT AAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATC CTATATGATGCCATTTCAT CCGACTATATTGTAAAA 
GAGAGTGGAAAAATTGACTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGAAAAAT TGACTTTATCCTATATGATGCCATTT CATCTGACTATATTGTAAAA 
GAG AGTGGAAAAAT TGAC T T TAT C CTATATGAT GC CAT TT CAT C TGAC TATAT T GT AAAA 

GATCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAA.TCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GAT CAATCATTAAACTTAAGCGTTTCT C CTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GATCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GATCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 

GGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACC!AAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 

GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAG 

GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGATTAGAATACCTCCTTTTAC CAAAAGATAAAAAAGGTAAAACTCTACAGAAATT TATA 

ATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAGTAAACAATAT 
ATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACAATAT 

ATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACAATAT 

ATAAGCGT 

ATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACAATAT 
ATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAGTAAACAATAT 
ATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAA(^AATAT 

ATAAGCGTAATAAAGTTTTGAAAGAAAATGGTA 

ATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAGTAAACAATAT 
ATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAGTAAACAATAT 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTT CAAACATTGATAAA 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTT CAAACATTGATAAA 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTTC^AACATTGATAAAGTRCMARATVSTNCSRATNGTSAGABC 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 

SEQ2401 

SEQ2402 

SEQ2403 

SEQ2404 

SEQ2405 

SEQ2406 

SEQ2407 

SEQ2408 

SEQ2409 

SEQ2410 

SEQ2 411 RANSRTRSTBS TRATBNDNGRTN 

>SEQ ID NO 2450: 8_1169NT frame: 1 

VSVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTIS 
TGIDAGKFDLSANDFS YNKERAEKYLFSDP I SRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPI KI KYVSGTTGVTSRLKNI ESGKIDFIL YDAI S SD YIVK 
DQSLNLSVSPLKGKIGlWKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTIiARLSKQY 
FGGDYVSNIDK 

>SEQ ID NO 2451:8_18RS21 frame: 1 

VSVQAS EKVELKVATDSDTAP FT YXKDGKFKG YDVDWKAVFKGS KYKVTFKTVP FDTI S 
TGIDAGKFDLSANDFS YNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPI KI KYVSGTTGVTSRIiKNIESGKIDF I LYDAI S SDYI VK 
DQSLNLSVS PLKGKI GNNKDGL EYTiLLPKDKKGKTLQKF INKR I KVLKENGTLARLS KQ Y 
FGGDYVSNIDK 

>SEQ ID NO 2452:8_2603 f rame : 2 

FKGYDVDWKAVFKGSKYKVTFKTVPFDTISTGIDAGKFDLSANDFSYNKERAEKYLFSD 
PISRSNYAWGKKGSHYKSLSDLSGKSTFATLSGVNYAQV^ KYVSG 
TTGVTSRLKNI ESGKIDFI LYDAI S SDYI VKDQSLNLSVS PLKGKIGNNKDGLEYLLLPK 
DKK 

>SEQ ID NO 2453:8_090 frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
TGIDAGKFDLSANDFS YNKERAEKYLFSDP I SRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENVINKNHPNKKPIKIKWSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
DQSIiNLSVS PLKGKI GNNKDGL EYLLLPKDKKGKTLQKF I NKR I KVLKF^GTLARLSKQY 
FGGDYVSNIDK 

>SEQ ID NO 2454:8_A909 frame: 1 

VSVQAS EKVELKVATDSDTAP FT YQKDGKFKGYDVDVVKAVFKGS KYKVTFKTVP FDT I S 
TGIDAGKFDLSANDFS YNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHXNKKPXKXKYVSGTTGVTSRLKNI ESGKIDFI LYDAI S SDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKR 

>SEQ ID NO 2455: 8_CJB110 frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTI S 
TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPI KIKWSGTTGVTSRLKNI ESGKIDFI LYDAI S SDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 
FGGDYVSNIDK 

>SEQ ID NO 2456: 8_COHl frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTI S 
TGIDAGKFDLSANDFS YNKERAEKYljFSDPI SRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNI ESGKIDFI LYDAI S SDYIVK 
DQSIiNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTLARLSKQY 
FGGDYVSNIDK 

>SEQ ID NO 2457:8_H36B frame: 1 

VSVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTIS 
TGIDAGKFDLSANDFS YNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 
FGGDYVSNIDK 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 



>SEQ ID NO 2458 : 8_JM9130013 frame: 1 

VS VQAS EKVELKVATDSDTAPFT YQKDGKF KG YDVD WKAVFKGS KYKVTF KTVPFDT I S 
TGIDAGKFDLSAlTOFSYNKERAEKYIiFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGWSFJ^^ 
I}QS]^SVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRNKVIjKENG 

>SEQ ID NO 2459:8_M732 frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTI s 

tgidagkfdlsandfsynkeraekylfsdpisrsnyawgkkgshykslsdlsgkstevl 

SGVNYAQVLENWNKNHPNKKPI KIKYVSGTTGVTSRIiKNI ESGKIDFI LYDAI SSDYIVK 
DQSLNLSVSPLKGKIGlSrNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTLARLSKQY 
FGGDYVSNIDK 

>SEQ ID NO 2460:8_M781 frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDTOWKAVFKGS KYKVTF KTVPFDT IS 
TGIDAGKFDLSANDFS YNKERAEKYLFSDP I S RSNYAWGKKGSH YKSLSDLS GKS TEVL 
SGVN^AQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
DQSLNLSVSPLKGKIGNNIODGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTIiARLSKQY 
FGGDYVSNIDK 
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SVQASEK^ELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKY-KVTFKTVPFDTIS 
SVQASEKVELKVATDSDTAPFTYXKDGKFKGYDVDWKAVFKGSKYKVTFKTVP FDTI S 

FKGYDVDWKAVFKGS KYKVTFKTVP FDTI S 

SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTIS 
SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
SVQASEKVEIiKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
S VQAS EKVELKVATDS DTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTIS 
SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
S VQAS EKVELKVATDSDTAPFT YQKDGKFKGYDVD WKAVFKGS KYKVTFKTVP FDTI S 
SVQASEK^ELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTIS 

TGIDAGKTDLSANDFSYNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGra 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYK5LSDLSGKSTEVL 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 

TGIDAGKFDLSANDFS YNKERAEKYLFSDPI SRSNYAVVGKKGSHYKSIiSDIjSGKSTEVL 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 

TGIDAGKFDLSANDFS YNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKiCGSHYKSLSDLSGKSTF^L 

SGVNYAQVLENWNKNHPNKKP I KI KYVSGTTGVTS RLKNI ESGKIDF ILYDAI S SDYI VK 
SGVNYAQVLENWNKNHPNKKPI KIKYVSGTTGVTSRLKNIESGKIDFILYDAI SSDYIVK 
SGVNYAQVLENWNKNHPNKKP I KIKYVSGTTGVTSRLKNI ESGKIDF I LYDAI SSDYIVK 
SGVNYAQVLENWNKNHPNKKPI KIKYVSGTTGVTSRLKNI ESGKIDFI LYDAI SSDYIVK 
SGVNYAQVLENWNKNHXNKKPXKXKYVSGTTGWSRLKNIESGKIDFILYDAISSDYIVK 
SGVNYAQVLENWNKNHPNKKP I KIKYVSGTTGVTSRLKNI ESGKIDFI LYDAI SSDYIVK 
SGVNYAQVLENWNKNHPNKKP I KIKYVSGTTGVTSRLKNI ESGKIDFILYDAIS SDYI VK 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
SGVNYAQVLENWNKNHPNKKPI KIKYVSGTTGVTSRLKNI ESGKIDFILYDAI SSDYIVK 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
SGVNYAQVLENWNKNHPNKKPI KI KYVSGTTGVTS RLKNI ESGKIDFI LYDAI S SDYI VK 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFIN 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKK 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKR 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTLARLSKQY 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARIiSKQY 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRNKVLKENG- 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTLARLSKQY 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTLARLSKQY 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

SEQ ID NO. 2501: SAG0368 FROM THE 090 GBS TYPE la STRAIN 

TATAATTTTTCGAC TAATGAATTGTC TAAGACT TTTAAAGATTTTAAGC TAGCTAAAT CAAAAAGTC ATGCTATTGAA 
GAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAAT 
AGCGATTCTATGAT C TTAGTC ACTATAAATCCTAAAACTAATAAAAC AACGATGACAAGC TTAGAACGTGAC GTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGG CATTGATGACTGTT C AAGAC TTATTAGATATTAATGTTGATTAC TTTATGCAAATTAATATG C AAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATG 
CGCTATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATA 
TTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATA 
TCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAA 
GACGCTACTTTAT C AGATGGTGGCTCTTAT CAAATTTTAACTAAGAAACATC TAG TTGCAGTT CAAAATAGAATTAAG 
AAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACTACTGCT 
AGTAATGATTCT T CTACTTATTC ATC AAC AC AAGAGAATAATTATAATAC AAC ACCTTATTC AGAAGC AC C AC C AAGT 
TAC AGTGGTAATAC TACTTAT AGTTCTG AGAC TAATCAAAC AACT C ATCAAAATTACTATAATAGTAGCACT C CTGCT 
AGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAACGGGGCTGCAACG 
CCTAATCCA 

SEQ ID NO. 2502: SAG0368 FROM THE 1169NT1 GBS TYPE V STRAIN 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
GAAAC AAAG C CGTT T TC AAT AC T AT T AATGGGGGTGG AC AC AGGT T C AG AGC AT C G AAAAT C T AAGT T G GT C AGG AAA 
TAGCGATT CTATGATCTTAGTC AC TATAAAT CC TAAAACTAATAAAACAACGATGACAAGC T TAGAACGTGACGTATT 
GATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGCGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGG 
TGCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGG 
ATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAA 
TGAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTAT 
GCGCTATGATGAT C CAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTC C TTAAAAAAAT 
ATTGGCGTTAAATAGTATT AGTT C ATACAAAAAAATTCTTT C CGCAGTAAGTAATAACATGCAAACTAATATTGAGAT 
ATC ATCAAAAAC GATTC CTAATTTGTTAGCTTATAAAGATT CATTGGAACATATTAAATCTTATCAGT TGAAAGGTGA 
AGACGCTACTTTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAA 
GAAAGAACTAGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACTACTGC 
TAGTAATGATTCTTCTACTTATTCATCAACACAAGAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAG 
TTACAGTGGTAATACTACTTATAGTTCTGAGACTAATCAAACAACTCATCAAAGTTACTATAATAGTAGCACTCCTGC 
TAATAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAATGGGGCTGCAAC 
GCCTAATCCA 

SEQ ID NO. 2503 SAG0368 FROM THE 18RS21 GBS TYPE II STRAIN 

TATAATTTTTCGAC TAATGAATTGTCTAAGAC TTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGC TATTGAA 
GAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACCAGAGTAC AAGGCTGTTGTTGAAC CAGGGACAGAT AAAAT AAATGGAGAACAAGCACTTGT TTATTCT CGTATG 
CGCTATGATGAT C C AGAGGGAGAT TATGGGCGT C AAAAAAGACAACGTGAAGTAATTCAAAAAGTC CTTAAAAAAATA 
TTGGCGTTAAAT AGTATTAGTT C ATACAAAAAAATT CTTT C CGC AGTAAGTAATAAC ATGC AAAC TAATATTGAGATA 
T CAT C AAAAACGAT T C C T AAT T TG T T AG C T TAT AAAG AT T C AT TG GAA CAT AT T AAAT C T TAT C AGT TG AAG GGTG AA 
GACGC TACTT TAT C AGATGGTGGC T CTTATC AAATTTTAACTAAGAAAC AT CTACT TGC AGT TC AAAATAGAATTAAG 
AAAGAAC TGGATAAAAAGCGT AGT AAAAC TC TGAAG AC AAGCGCGATT CTATATGAAGATTAC TATGGTACTACTGCT 
AGTAATGATTC TTCTACT TATT C ATC AAC ACAAGAGAATAATTATAAT ACAAC AC C TTATTCAGAAGCACCAC C AAGT 
TACAGTGGTAATACTACTTATAGT TC TGAGAC TAAT C AAAC AACT CATCAAAATTACTAT AAT AGT AGC ACT C CTGC T 
AGTAACTATAGC AGTAACACTAAC ACAGGTC AGGCTGATTCAAGTGGAAGTGT CAATAAT C ATAACGGGGCTG C AACG 
CCTAATCCA 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

SEQ ID NO. 2504: SAG0368 FROM THE 2603 V/R GBS TYPE V STRAIN 

TATAATTTTTCGAC TAATGAATTGT CTAAGAC TTT T AAAGATTTTAAGCTAGCTAAATCAAAAAGTC ATGCT ATTGAA 
GAAAGAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACC AGAGTAC AAGGCTGTTGTTGAAC CAGGGAC ACATAAAATAAATGGAGAAC AAGCACTTGTTTATT CT CGTATG 
CGC TATGATGATC CAGAGGGAGATTATGGGCGTC AAAAAAGACAACGTGAAGTAATT C AAAAAGTC CTTAAAAAAATA 
TTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATA 
T CATCAAAAACGATTC C TAATTTGTTAGC TT ATAAAGATTCATTGGAAC ATATTAAATCT TATCAGTTGAAGGGTGAA 
GACGCTACTTTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAG 
AAAGAACTGGAT AAAAAG CGTAGTAAAACTCTGAAGAC AAGCGCGATT C TATATGAAGAT TACTATGGTACTACTGCT 
AGTAATGATTCTT CTACTT ATT C ATC AACACAAGAGAATAATTATAATACAAC ACCTTAT TCAGAAGCAC CAC CAAGT 
TAC AGTGGTAATACTACTTATAGTTC TGAGACTAAT CAAAC AACTCAT CAAAATTACTATAATAGTAGCACT C CTGCT 
AGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAACGGGGCTGCAACG 
CCTAATCCA 

SEQ ID NO. 2505: SAG0368 FROM THE A909 GBS TYPE la STRAIN 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
G AAAC AAAG C C GT T T T C AAT AC T AT TAAT GGGGGTGG AC AC AGGT T C AG AG C ATCGAAAAT C T AAGT GGT C AG GAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTC AATG CTGTTGGTGGTATAAC AGTAAC TAATAAAT TTGACTTT C C AATATCAATTGCTGC CAAT 
GAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATG 
CGC TATGATGAT C C AGAGGGAGAT TATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTC CTTAAAAAAATA 
TTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATA 
TCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAA 
GACGCTACTTTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAG 
AAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGG;TACTACTGCT 
AGTAATGATTCTT CTACTTAT TCATCAAC AC AAGAGAATAATT AT AAT ACAACAC CTTATTCAGAAGCAC C ACC AAGT 
TAC AGTGGT AAT ACTACTTATAGT TC TGAGACTAATC AAAC AACTC ATC AAAAT TAC TAT AATAGTAGC AC TCCTGCT 
AGTAACTATAGC AGTAAC AC TAAC AC AGGTC AGGCTGATT CAAGTGGAAGTGTCAATAAT C ATAACGGGGCTGCAACG 
CCTAATCCA 

SEQ ID NO. 2506: SAG0368 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
GAAACAAAGC CGTTTTC AATACTATTAATGGGGGTGGAC ACAGGTT CAGAGC ATCGAAAATCTAAGTGGTC AGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGGC ATTGATGACTGTT CAAGACTTAT TAGATATTAATGTTGATTAC TTTATGC AAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAAC C AGAGTAC AAGGCTGT TGTTGAACCAGGGAC AC ATAAAATAAATGGAGAAC AAGCAC TTGTTTATTC TCGTATG 
CGCTATGATGAT C CAGAGGGAGAT TATGGGCGTC AAAAAAGACAACGTGAAGTAATT C AAAAAGTC CTTAAAAAAATA 
TTGG CGTTAAAT AGTATTAGTTC ATAC AAAAAAATTCT TTC CGC AGTAAGTAATAACATG CAAACTAATATTGAGATA 
TCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAA 
GACGCTACTTTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAG 
AAAGAACTGGAT AAAAAGCGTAGTAAAACT CTGAAGACAAGCGCGATT CT AT ATGAAGATT ACTATGGTACTACTG CT 
AGTAATGATT CTT C TACTTATTC ATC AACACAAGAGAATAATTAT AAT AC AACAC CT TAT TCAGAAGCAC CAC CAAGT 
TACAGTGGTAATACTACTTATTAGTTCTGAGACTAATCAAACAACTCATCAAAATTACTATAATAGTAGCACTCCTGC 
TAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAACGGGGCTGCAAC 
GCCTAATCCA 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

SEQ ID NO. 2507: SAG0368 FROM THE COHl GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

GATTTTAAGCTAGATAAATCAAAAAGTC^TGCTATTGAAGAAACAAAGCCGTTTTC^ 

ACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACT 
AATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGC 
GTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGTGCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGAT 
ATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGATTAGTTGATTTGGTCAATGCTGTTGGTGGTATAACAGTA 
ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACAT 
AAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCGCTATGATGATCCAGAGGGAGATTATGGGCGTCAAAAA 
AGACAACGTGAAGTAAT TC AAAAAGTC CTTAAAAAAATATTGGCGTTAAATAGTATTAGTT CATACAAAAAAATT CTT 
TCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGAT 
TCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTACTCTATCAGATGGTGGCTCTTATCAAATTTTA 
ACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAGAAAGAGCTGGATAAAAAGCGTAGTAAAACTCTGAAGACA 
AGCGCGATTCTATATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTACTTATTCATCAACACAAGAGAAT 
TATTATTATACAACACCCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATACTACTTATAGTTCTGAGACTAATCA 
AAC AACTCAT CAAAGTTACTATAATAGTAGCACT C C TGCTAGTAACTATAGC AGTAACACTAACACAGGTCAGGCTGA 
TTCAAGTGGAAGTGTTAATAATTATAACGGGGCTGC AACGC CTAATC C AAAC AC AGGAAC GC AACCAGTAC C AGGT C A 
AACTAATCCA 

SEQ ID NO. 2508: SAG0368 FROM THE H36b GBS TYPE lb STRAIN 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
GAAACAAAGC CGTTTTCAATACTATTAATGGGGGTGG ACAC AGGTTC AGAGC ATCGAAAATC TAAGTGGTCAGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGT CC CAAAAAT AATGGACAGACTGGAGTAGAAGC AAAGCTAAATGC AGC CTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTT GGTGGTATAACAGTAAC TAATAAATTTG ACTTT C CAATAT CAATTGC TGC CAAT 
GAAC C AGAGTAC AAGGC TGTTGTT GAACC AGGGAC ACATAAAATAAATGGAGAAC AAGC ACTTGTTTATTCT CGTATG 
CGCTATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATA 
TTGGCGTTAAATAGTA 

SEQ ID NO. 2509: SAG0368 FROM THE 

TTAGTTCATACAAAAAAATTCTTT CCGCAGTAAGTAATAAC ATGCAAACTAATATTGAGATATC AT CAAAAACGATT C 
C T AATT T GT TAG C T TAT AAAG AT T C AT TGGAACAT AT T AAAT C T TAT C AGT TG AAGGGTG AAG AC G C TACT T TAT C AG 
ATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAGAAAGAACTGGATAAAA 
AGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTA 
CTTATTCATCAACACAAGAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATACTA 
CTTATAGTTCTGAGACTAATCAAACAACTCATCAAAATTACTATAATAGTAGCACTCCTGCTAGTAACTATAGCAGTA 
ACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAACGGGGCTGCAACGCCTAATCCA 

SEQ ID NO. 2510: SAG0368 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE 
COMPLEMENT) 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
GAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGT GGT C C C AAAAATAATGGAC AGACTGGAGTAGAAGCAAAGCTAAATGCAGC CTATGCTTC TGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAgCACTTGTTTATTCTCGTATG 
CGCTATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATA 
TTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATA 
TCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAA 
GACGCTACTTTAT CAGATGGTGGC TCTTATCAAATT TTAACTAAGAAAC ATCTACTTGCAGT TC AAAATAGAATTAAG 
AAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACTACTGCT 
AGTAATGATT C TT CTACTTATTC ATC AAC AC AAGAGAAT AATTATAAT AC AACAC C TTATT C AGAAGCACC ACC AAGT 
TAC AGTGGTAATACTACT TATAGTTC TGAGACTAATCAAAC AACT CAT C AAAATTAC TAT AAT AGT AGC AC T CC TGC T 
AGT AAC TATAGC AGTAACACTAAC ACAGGTC AGGCTGATTC AAGTGGAAGTGT CAATAAT C ATAACGGGGC TGCAACG 
CCTAATCCA 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

SEQ ID NO. 2511; SAG0368 FROM THE M781 GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

TTCAATAC T ATTAATGGGTGTGGACACAGGTTCAGAGC ATCGAAAATC TAAGTGGT C AGGAAATAGCGATTC TATGAT 
CTTAGTCACTATAAAT C C TAAAACTAATAAAAC AACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGG 
TCCCAAAAATAATGGACAGACTGGCGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGTGCGGAAATGGCATT 
GATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGATTAGTTGATTTGGT 
CAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAA 
GGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCGCTATGATGATCC 
AGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATATTGGCGTTAAATAG 
TATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGAT 
TCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTACTCTATC 
AGATGGTGGCTC TTATCAAATTTT AACTAAGAAAC AT CTACTTGC AGTT CAAAATAGAATTAAGAAAGAGCTGGATAA 
AAAGCGTAGTAAAACTCTGAAGACAAGCGCGATT CTATATGAAGATTAC TATGGTACTAC TGCTAGTAATGATTCTT C 
TACTTATTCATCAACACAAGAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 
TACTTATAGTTC TGAGACTAAT C AAAC AAC TCATC AAAGTTACTATAATAGTAGCACTCC TGCTAGTAAC TATAGCAG 
TAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTTAATAATTATAACGGGGCTGCAACGCCTAATCCAAACAC 
AGGAACGC AACCAGTACC AGGT C AAAC TAATC C A 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ250 9 
SEQ2510 
SEQ2511 



ATTTTAAGCTAGATAAATCAAAAAGTCATGCTATTGAAGAAACAAAGCCGTTTTCAATA 



-TTCAATA 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



TATTAATGGGTGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAATAGC 



TATTAATGGGTGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAATAGC 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



ATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTA 



ATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTA 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2 507 
SEQ2 508 
SEQ2509 
SEQ2510 
SEQ2511 



AACGT<3AC GTATTGATTAAATTGAGTGGTC C C AAAAATAATGGACAGAC TGGCGTAGAA 



AACGTGACGTATTGATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGCGTAGAA 



SEQ2 5 01 
SEQ2502 
SEQ2503 
SEQ2.5 04 
SEQ2505 
SEQ2 506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



C AAAGCTAAATGC AGC CTATGC TTC TGGTGGTGCGGAAATGGC ATTGATGACTGTT CAA 



CAAAGCTAAATGCAGCCTATGCTTCTGGTGGTGCGGAAATGGCATTGATGACTGTTCAA 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2 505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



ACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGATTAGTTGAT 



ACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGATTAGTTGAT 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2 50 6 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2 510 
SEQ2511 



TGGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATT 



TGGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATT 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ25 08 
SEQ2509 
SEQ2510 
SEQ2511 



CTGCCAATGAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGA 



CTGCCAATGAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGA 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

SEQ2501 



SEQ2502 

SEQ2503 

SEQ2504 

SEQ2505 

SEQ2506 

SEQ2507 AACAAGCACTTGTTTATTCTCGTATGCGCTATGATGATCCAGAGGGAGATTATGGGCGT 

SEQ2508 

SEQ2509 

SEQ2510 

SEQ2 5 11 AACAAGCACTTGTTTATTCTCGTATGCGCTATGATGATCCAGAGGGAGATTATGGGCGT 

SEQ2501 TATAATTTTTCG 

SEQ2 502 TATAATTTTTCG 

SEQ2503 TATAATTTTTCG 

SEQ2504 ^ TATAATTTTTCG 

SEQ2505 TATAATTTTTCG 

SEQ2 506 TATAATTTTTCG 

SEQ2 507 AAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATATTGGCGTTAAATAGT 

SEQ2 508 TATAATTTTTCG 

SEQ2509 

SEQ2 510 TATAATTTTTCG 

SEQ2511 AAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATATTGGCGTTAAATAGT 

SEQ2 501 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGC TAAATCAAAAAGTCATGCT 

SEQ2502 CTAATGAATTGTCTAAGAC TTTTAAAGATTTTAAGCTAGC TAAAT C AAAAAGTC ATG C T 

SEQ2503 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 504 CTAATGAATTGTCTAAGACTTTT AAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 505 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 506 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 507 TTAGTTCAT - ACAAAAAAATTCTTTCCGCAGTAAGTAA- - TAACATGCAAACTAATATT 

SEQ2508 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 509 TTAGTTCAT - ACAAAAAAATTCTTTCCGCAGTAAGTAA- - TAACATGCAAACTAATATT 

SEQ2 510 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2511 TTAGTTCAT - ACAAAAAAATTCTTTCCGCAGTAAGTAA - - TAACATGCAAACTAATATT 

SEQ2 501 TTGAAGAAACAAAGC CGTTT T C AATACTATTAATGGGGGTGGACAC AGGTTC AGAGCAT 

SEQ2502 TTGAAGAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCAT 

SEQ2 503 TTGAAGAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCAT 

SEQ2 504 TTGAAGAAACAAAGC CGTTTT CAATACTATTAATGGGGGTGGAC AC AGGTTC AGAGCAT 

SEQ2 505 TTGAAGAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCAT 

SEQ2 506 TTGAAGAAACAAAGCC GTTTTC AAT AC TATTAATGGGGGTGGAC ACAGGTTCAGAG CAT 

SEQ2507 AGATATC ATC AAAAACGATTC CTAATTTGTTAGCTTATAAAGAT TC A TTGGAACAT 

SEQ2 508 TTGAAGAAACAAAGC CGTTT TC AATACTATTAATGGGGGTGGACAC AGGTTC AGAGCAT 

SEQ2 509 AGATATCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCA TTGGAACAT 

SEQ2 510 TTGAAGAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCAT 

SEQ2 511 AGATATCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCA TTGGAACAT 

SEQ2 501 GAAAATCTAAGT - GGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAA 

SEQ2 502 GAAAATCT AAGTTGGTC AGGAAAT AGCGAT TC T ATGAT CTTAGT C AC T ATAAATC C TAA 

SEQ2 503 GAAAATCTAAGT - GGTC AGGAAAT AG CGATTCTATGATCTTAGTC AC TAT AAATCC TAA 

SEQ2504 GAAAATCTAAGT-GGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAA 

SEQ2 505 GAAAATCTAAGT - GGT CAGGAAATAGC GAT T CTATGAT CT TAGTC ACTATAAATCCTAA 

SEQ2 506 GAAAATCTAAGT - GGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAA 

SEQ2507 TTAAATCTTATC - AGTTGAAGGGTGAAGACGCTACTCTATCAG - - ATGGTGGCTCTTAT 

SEQ2 508 GAAAATCTAAGT - GGT CAGGAAATAGC GATTCTATGATCT TAGTC ACTATAAATCCTAA 

SEQ2 5 0 9 TTAAATCTTATC - AGTTGAAGGGTGAAGACGCTACTTTATCAG- - ATGGTGGCTCTTAT 

SEQ2510 GAAAATCTAAGT- GGT CAGGAAATAGCGATTCTATGATCT TAGTC AC TAT AAATCC TAA 

SEQ2 511 TTAAATCTTATC- AGTTGAAGGGTGAAGACGCTACTCTATCAG- -ATGGTGGCTCTTAT 

SEQ2501 ACT AATAAAACAACGATGAC AAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 



SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2 505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGAT TAAATTGAGTGGT C C 
ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
AAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAGAAAGAGCTGGAT 
AC T AATAAAAC AACGATGACAAGCTT AGAACGTGACGT ATTGATTAAATTGAGTGGT CC 
AAAT T TTAACTAAGAAACAT CTACTTGC AGTTC AAAAT AGAATT AAGAAAGAACTGGAT 
ACTAATAAAAC AACGATGAC AAGCTTAGAACGTGACGTATTGAT TAAATTGAGTGGTC C 
AAATTTTAAC TAAGAAAC ATCTACTTGCAGTT CAAAAT AGAATTAAGAAAGAGC TGGAT 

AAAAAT AATGGACAGACTGGAGTAGAAGC AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAATAATGGAC AGACTGGCGTAGAAGC AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAAT AATGGACAGACTGGAGTAGAAGC AAAG- - CTAAATGCAGCCTATGCTTCTGGT 
AAAAAT AATG G AC AG AC TGG AGT AGAAG C AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAAT AAT GGAC AG AC T G G AGT AGAAG C AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAATAATGGAC AGACTGGAGTAGAAGC AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACT 
AAAAATAATGGACAGACTGGAGTAGAAGCAAAG- -CTAAATGCAGCCTATGCTTCTGGT 
AAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACT 
AAAAATAATGGAC AGACTGGAGTAGAAGCAAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAG CGT AGT AAAACT C TGAAG ACAAG C G C GAT T C TAT AT G AAGAT T AC T AT GGT AC T 

GTGC - GGAAATGGCATTGATGACTGTTCAAGAC TTATTAGATATTAATGTTG ATTAC TT 
GTGC - GGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTT 
GTGC - GGAAATGGC ATTGATGAC TGTT C AAGACT TATTAGATATTAATGTTGATTACTT 
GTGC - GGAAATGGC ATTGATGAC TGTT CAAGAC TTATTAGAT ATT AATGTTGATTAC TT 
GTGC - GGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTT 
GTGC - GGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTT 
CTGC TAGTAATGATTC TTC TACT TATTC ATC AAC - ACAAGAGAATTATTATTAT - ACAA 
GTGC - GGAAATGGC ATTGATGAC TGTTCAAGACTTATTAGAT AT TAATGTTG ATTAC TT 
CTGCTAGTAATGATTCTTCTACTTATTCATCAAC-ACAAGAGAATAATTATAAT-ACAA 
GTGC - GGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTT 
CTGCTAGTAATGATTCTTCTACTTATTCATCAAC - AC AAGAGAATAATTATAAT - ACAA 

ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
ATGC AAATTAATATGCAAGGATTAGTTGATT T AGT C AATGC TGTTGGTGGT ATAAC AGT 
ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 

ACCCTTATTCAGAAGCACCACCAAGTTACAGTGGT - AATACTACTTATAGTT CTGA 

ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 

ACC - TTATTCAGAAGCACCACCAAGTTACAGTGGT- AATACTACTTATAGTT CTGA 

ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
ACC - TTAT TC AGAAGC AC C AC C AAGTTAC AGTGGT - AATACTACTTATAGTT CTGA 

AC TAATAAATTTGACTTTC C AATAT C AATTGCTGC C AATGAACC AGAGTACAAGGC TGT 
ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGT 
ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGT 
ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGT 
AC TAATAAATTTGACTTTC CAATATCAATTGCTGCCAATGAACC AGAGTACAAGGC TGT 
AC TAAT AAATTTGACTTT CC AATAT C AAT TG C TGC C AATGAAC C AGAGT AC AAGG C TGT 

ACTAATCAAAC - AACTCATCAA AGTTACTAT - AATAG - - TAGCACTCCTGCTAGT 

ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGT 

ACTAATCAAAC -AACTCATCAA AATTACTAT -AATAG- -TAGCACTCCTGCTAGT 

ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGT 
ACTAATCAAAC - AACTCATCAA AGTTACTAT - AATAG - - TAGCACTCCTGCTAGT 
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SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510. 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ25 04 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2 505 
SEQ2 506 
SEQ2507 
SEQ2508 
SEQ250 9 
SEQ2510 
SEQ2511 



Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

GT TG AAC C AGGGAC AC AT AAAAT AAATGG AGAAC AAG C AC T T GT T TAT T C T CGT ATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
AC TATAGC AGTAAC AC - TAACACAGGTCAGGCTGATTCAAGTGGAAGTGTTAATAATTA 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
ACTATAGCAGTAACAC - TAACACAGGTC AGGC TGATTCAAGTGGAAGTGT CAATAATC A 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
ACTATAGCAGTAACAC - TAACACAGGTCAGGCTGATTCAAGTGGAAGTGTTAATAATTA 

TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
AACGGGGCTGCAACGCC TAAT CCAAACACAGGAACGCAACCAGTAC CAGGTCAAACTAA 
TATGATGAT C C AGAGGGAGAT T ATGGGC GT C AAAAAAGAC AACGTGAAGT AAT T C AAAA 

AACGGGGCTGCAACGCCTAATCCA 

TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
AACGGGGCTGC AACGCCTAATCCAAACAC AGGAACGCAAC C AGTACC AGGTCAAAC TAA 

GT CCTTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATAC AAAAAAATTCTTTC CGC 
GT CC TTAAAAAAATATTGGCGTTAAATAGT AT TAGTTC ATACAAAAAAAT TCTTT C CGC 
GTCCTTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGC 
GT CC TTAAAAAAATATTGGCGTTAAATAGTATTAGTTC ATACAAAAAAATTCTTTC CGC 
GTCC TTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTC CGC 
GTCCTTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGC 

CCA 

GTCCTTAAAAAAATATTGGCGTTAAATAGTA 



GTCCTTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGC 
CCA 

GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 



GTAAGTAATAACATGC AAACTAATATTGAGAT ATCATCAAAAACGATT C C TAATTTGT T 



GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 
GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAAGGTGAAGACGCTAC 
GC TTATAAAGATTCAT TGGAACATATTAAATC TTATCAGTTGAAGGGTGAAGACGC TAC 
GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 
GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 
GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 



GCTTATAAAGATTC ATTGGAACATAT TAAATCTTATC AGTTGAAGGGTGAAGACGC TAC 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCT CTTATC AAATTTTAACTAAGAAACATCTACTTGCAGTT CAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 



TT AT C AGATGGTGGC T CTTAT CAAATTTTAAC TAAGAAAC ATCTAC TTGCAGTTC AAAA 



AGAATTAAGAAAGAACTGGATAAAAAG CGTAGTAAAAC TCTGAAGACAAGCGCGATT C T 
AGAATTAAGAAAGAACTAGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCT 
AGAATTAAGAAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCT 
AGAATTAAGAAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCT 
AGAATTAAGAAAGAAC TGGATAAAAAGCGTAGTAAAACT CTGAAGACAAGCGCGATTCT 
AGAATTAAGAAAGAAC TGGATAAAAAGCGTAGTAAAAC TCTGAAGACAAGCGCGATTC T 



AGAATTAAGAAAGAAC TGGATAAAAAGCGTAGTAAAAC TCTGAAGACAAGCGCGATTCT 



TATGAAGATTACTATGGTACTACTGC TAGTAATGATTCTT CTACTTATTCATC AAC ACA 
TATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTACTTATTCATCAACACA 
TATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTACTTATTCATCAACACA 
TATGAAGATTACTATGGTAC T AC TGCTAGTAATGATTC TTCTACTTATTCATC AAC AC A 
TATGAAGATTACTATGGTAC TACTGCTAGTAATGATTCTTCTAC TTATTCATCAAC ACA 
TATGAAGATTACTATGGT ACTACTGCTAGTAATGATT CTT CTAC TTATT CATC AAC ACA 



TATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTACTTATTCATCAACACA 



GAGAAT AATTATAATAC AAC AC CTTATTC AGAAGC AC C ACCAAGTTACAGTGGTAATAC 
GAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 
GAGAATAATTATAATACAAC AC CTTATT CAGAAGCAC C AC C AAGTTAC AGTGGTAATAC 
GAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 
GAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 
GAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 



GAGAATAATTATAATACAAC AC CTTATTCAGAAGCAC C AC C AAGTTACAGTGGTAATAC 



ACTTAT - AGT T CTGAGACT AAT CAAAC AAC T CAT C AAAATTACTATAATAGTAGC ACT C 
ACTTAT - AGTTCTGAGACTAATCAAACAACTCATCAAAGTTACTATAATAGTAGCACTC 
ACTTAT - AGTTCTGAGACTAATCAAACAACTCATCAAAATTACTATAATAGTAGCACTC 
ACTTAT - AGT T CTGAGACTAATC AAACAAC T C ATC AAAAT TAC TATAATAGT AGC ACT C 
ACTTAT- AGTTCTGAGACTAATCAAACAACTCATCAAAATTACTATAATAGTAGCACTC 
ACTTATTAGTT CTGAGACTAATC AAACAACT CATC AAAATTACTATAATAGTAGC ACT C 



AC T TAT - AGTTC TGAGAC T AAT CAAAC AACT CATCAAAATTAC TAT AAT AGT AGC ACTC 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

SEQ2501 TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 

SEQ2 502 TGCTAATAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 

SEQ2 503 TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 

SEQ2 504 TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 

SEQ2 505 TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 

SEQ2506 TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 

SEQ2507 

SEQ2508 

SEQ2509 

SEQ2 510 TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 

SEQ2511 

SEQ2501 TAATCATAACGGGGCTGCAACGCCTAATCCA 

SEQ2502 TAATCATAATGGGGCTGCAACGCCTAATCCA 

SEQ2 503 TAATCATAACGGGGCTGCAACGCCTAATCCA 

SEQ2504 TAATCATAACGGGGCTGCAACGCCTAATCCA 

SEQ2 505 TAATCATAACGGGGCTGC AACGC C TAAT CCA 

SEQ2506 TAATCATAACGGGGC TGC AACGC CTAATCCA 

SEQ2507 

SEQ2508 

SEQ2509 -1 

SEQ2510 TAATCATAACGGGGCTGCAACGCCTAATCCA 

SEQ2511 



>SEQ ID NO 2550: 54_090 frame: 1 

^FSTNELSKTFKDFKIiAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKI^ 

DYFMQI NMQGLVDLVNAVGGI TVTNKFDF P I S I AANEP E YKAWEPGTHKINGEQALVYS 
RMR YDDPEGDYGRQKRQREVIQKVLKKILALNSI SS YKKILSAVSNNMQTNIE I SSKTI P 
IttliAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLI.AVQNRIKKELDKKRSKTLKTS 
AILYEDY/yGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETN^QTTHQNYYNS 
S TPASNYS SNTNTGQADS S GSVNNHNGAATPNP 

>SEQ ID NO 2S51?54_1169NT frame: 1 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKLVRK . RFYDLSH 
YKS . N . . NNDDKLRT . RID . IEWSQK . WTDWRRS KAKCS L CFWWCGNG IDDCSRLI RY . C 
. LLYAN . YARIS . FSQCCWWYNSN . . I . LSNINCCQ . TRVQGCC . TRDT . NKWRTS TCLF 
SYAL. . SRGRLWASKKTT . SNSKSP . KNIGVK . Y - FIQKNSFRSK . .HAN. Y.DIIKNDS 
. FVSL . RFIGTY . ILSVER . RRYFIRWWLLSNFN . ETSTCSSK . N . ERTR . KA . . NSEDK 
RDSI.RLLWYYC. . . FFYLFINTRE . I* . YNTLFRSTTKLQW . YYL . F . D . SNNSSKLL . . 
.HSC. .L.Q.H.HRSG.FKWKCQ.S.WGCNA.S 

>SEQ ID NO 2552s54_I8RS21 frame: 1 

YJNFSTNELSKTFKDFIOjAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLERDVDIKLSGPKNNGQTGVEfliCLNAAYASGGAEMALMTVQDLLDINV 
D YFMQINMQGLVDLVNAVGGI TVTNKFDFP I S IAANEPE YKAWEPGTHKI NGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKILALNS I SSYKKI LSAVSNNMQTNIEI SSKTIP' 
NLIiAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 
AILYEDYYGTTAS1TOSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 
S TPASNYS SNTNTGQADSS GSVNNHNGAATPNP 

>SEQ ID NO 2553 :54_2 603 frame: 1 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLEFJD\n^IKLSGPKNNGQTGVEAKljNAAYASGGAENmLMIVQDLLDINV 
DYFMQINMQGLVDLVNAVGGITVTNKFDFPI S IAANEPE YKAWEPGTHKI NGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKIIiALNSISSYKKILSAVSNNMQTNIEI SSKTIP 
NLLA YKD S L EH I KS YQ L KGED ATL SDGGS YQ I LTKKHL LA VQNR I KKE LDKKR S KTLKTS 
AlliYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 
STPASNYS SNTNTGQADSS GSVNNHNGAATPNP 

>SEQ ID NO 2554: 54_A909 frame: 1 

YNFSTNELSKTFKDFKIAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INP KTNKTTMTSLERDVL I KLSGPKNNGQTGVEAKLNAA YASGGAEMALMTVQDLLDINV 
DYFMQ I NMQGLVDLVNAVGGI TVTNKFDF PI S IAANEPE YKAWEPGTHKI NGEQALVYS 
RMR YDDPEGDYGRQ KRQREVIQKVLKKILALNSI SSYKKILSAVSNNMQTNIEI SSKTI P 
NLLA YKDS LEHI KS YQLKGEDATLSDGGS YQI LTKKHLLA VQNR I KKELDKKRS KTLKTS 
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Table 25: Comparative Sequences relating to SAG0368 
(protein of unknown function) 

AI L YED YYGTTASNDS ST YS S TQENNYNTTP YSEAP P S YS GNTT YS S ETNQTTHQNY YNS 
STPASNYS SNTNTGQADS SGSVNNHNGAATPNP 



>SEQ ID NO 2555:54_CJB110 frame: 1 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSiavSGNSDSMILVT 
INPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKIjNAAYASGGAEMALMWQDLLDINV 
DYFMQINMQGLVDLVNAVGGI TVTNKFDFP I S IAANEPE YKAWEPGTHKINGEQALVYS 
RMRYDDPEGDYGRQKRQREVI QKVLKKILALNS I SSYKKILSAVSNNMQTNIEISSKTI P 
NL^YKDSLEHIKSYQLKGEDATLSDGGSYQIBTKKHLLAVQNRIKKELDK^CRSKTLKTS 
AIL YED YYGTTASNDS S T YS S TQENNYNTTP YS EAP P SYS GNTT Y . F.D. SNNSSKLIi . . 

>SEQ ID NO 2556:54_COHl frame: 1 

DFKLDKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVTINPKTNKTTMTSL 
ERDVL I KLS GPKNNGQTGVEAKLNAAYAS GGAEMALMTVQDLLD I NVD YFMQI NMQGLVD 
LVNAVGGI TVTNKFDFP I S I AANE PE YKAWE PGTHKI NGEQALVYS RMRYDDPEGD YGR 
QKRQREVI QKVLKKI LALNS I S S YKKI LS AVSNNMQTNI E I S SKTI PNLLA YKDSLEHI K 
SYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTSAILYEDYYGTTAS 
NDSSTYSSTQENYYYTTPLFRSTTKLQW . YYL .F.D. SNNSSKLL . . . HSC . .L.Q.H.H 
RSG . FKWKC . . L . RGCNA . SKHRNATSTRSN . S 

>SEQ ID NO 2557:54_H36B frame: X 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKLNAAYA^ 

D YFMQ I NMQGLVDL VNAVGGI TVTNKFDF P I S IAANEPE YKAWEPGTHKINGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKILALNS I S S YKKILSAVSNNMQTNIEI SS KTI P 
NLIiAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 
AI L YED YYGTTASNDS S T YS STQENNYNTTP YSEAPP S YS GNTT YS S ETNQTTHQNY YNS 
S TPASNYS SNTNTGQADS S GS VNNHNGAATPNP 

>SEQ ID NO 2558:54_JM9130013 frame: 1 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKL^^ 

DYFMQINMQGLVDLVNAVGGI TVTNKFDFP I S IAANEPEYKAWEPGTHKINGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKI LALNS I SS YKKILSAVSNNMQTNIEI SSKTIP 
NLLA YKDSLEHI KS YQLKGEDATLSDGGS YQILTKKHLLAVQNRIKKELDKKRS KTLKTS 
AI L YED YYGTTASNDS ST YS STQENNYNTTP YSEAPP SYS GNTT YSS ETNQTTHQNY YNS 
STPASNYS SNTNTGQADS SGSVNNHNGAATPNP 

>SEQ ID NO 2559:54_M781 frame: 2 

S ILLMGVDTGSEHRKSKWSGNSDSMILVTINPKTNKTTMTSLERDVLI KLSGPKNNGQTG 
VEAKIiNAAYASGGAEMALMTVQDLLDINVDYFMQINMQGLVDLVNAVGGITVTNKFDFPI 
SIAANEPEYKAVVEPGTHKINGEQALVYSRMRYDDPEGDYGRQKRQREVIQKVLKKILAL 
NSISSYKKILSAVSNNMQTNIEISSKTIPNLLAYKDSLEHIKS YQLKGEDATLSDGGS YQ 
ILTKIQSLLAVQNR I KKELDKKRS KTLKTS AIL YED YYGTTASNDS ST YSS TQENNYNTTP 
YSEAPP SYS GNTT YSSETNQTTHQSYYNS STPASNYS SNTNTGQADS SGSVNNYNGAATP 
NPNTGTQPVPGQTNP 



SEQ2550 NFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2 551 NFS TNELS KTFKDF KLAKS KSHAI EETKPFS I LLMGVDTGS EHRKS KLVRKRF YDL SHY 

SEQ2 552 NFSTNELSKTFKDFKLAKS KSHAI EETKPFS I LLMGVDTGSEHRKSKWSGNSDSMI LVT 

SEQ2553 NFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2 554 NFSTNELS KTFKDF KLAKS KSHAI EETKPFS I LLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2 555 NFSTNELSKTFKDFKLAKSKSHAI EETKPFS I LLMGVDTGSEHRKSKWSGNSDSMI LVT 

SEQ2556 DFKLDKS KSHAI EETKPFS ILLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2557 NFSTNELSKTFKDFKLAKSKSHAI EETKPFS ILLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2558 NFSTNELSKTFKDFKLAKS KSHAI EETKPFS ILLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2559 S I LLMGVDTGS EHRKS KWSGNSDSMILVT 

SEQ2550 NPKTNKTTMTS LERDVL I KLS GPKNNGQTGVEAKLNAA YAS GGAEMALMTVQDLLD INV 

SEQ2 551 SNNNDDKLRTRIDI EWSQKWTDWRRS KAKCSLCFWWCGNGIDDCSRLIRYCLLY 

SEQ2552 NPKTNKTTMTS LERDVLI KLS GPKNNGQTGVEAKLNAA YAS GGAEMALMTVQDLLD INV 

SEQ2553 NPKTNKTTMTS LERDVL I KLS GPKNNGQTGVEAKLNAA YAS GGAEMALMTVQDLLD INV 

SEQ2 554 NPKTNKTTMTS LERDVL I KLS GPKNNGQTGVEAKLNAA YASGGAEMALMTVQDLLD INV 

SEQ2555 NPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKLNAA YASGGAEMALMTVQDLLD INV 

SEQ2 556 NPKTNKTTMTSLERDVLI KLS GPKNNGQTGVEAKLNAAYAS GGAEMALMTVQDLLD INV 

SEQ2 5 5 7 NPKTNKTTMTSLERDVLI KLSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLDINV 
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SEQ2558 NPKTmCTTMTSIiERDVIiIKIiSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLDINV 

SEQ2 559 NPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKLNAAYASOT 

SEQ2550 YFMQINMQGLVDLVNAVGGITVTNKFDFPISIAANEPEYKAVVEPGTHKINGEQALVYS 

SEQ2551 NYARISFSQCCWWYNS -NILSNINCCQTRVQGCCTRDTNKWRTSTCLFSY 

SEQ2 552 YFMQINMQGLVDLWAVGGITVTNKFDFP I S I AANEPE YTCAWEPGTHKINGEQALVYS 

SEQ255 3 YFMQINMQGLVDLVNAVGGITVTNKFDFPISIAANEPEYKAVVEPGTHKINGEQALVYS 

SEQ2 554 YFMQINMQGLVDLVNAVGGITVTNKFDFP IS I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2 555 YFMQINMQGLVDLVNAVGGITVTNKFDFP I S I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ255 6 YFMQINMQGLVDLVNAVGGITVTNKFDFPISIAANEPEYKAWEPGTHKINGEQALVYS 

SEQ2 557 YFMQINMQGLVDLVNAVGGITVTNKFDFP IS I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2558 YFMQINMQGLVDLVNAVGGI TVTNKFDFPIS I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2559 YFMQINMQGLVDLVNAVGGITVTNKFDFPI S I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2550 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I SS YKKI LSAVSNNMQTNI EI S SKTI P 

SEQ2551 LSRGRLWASKKTTSNSKSPKNIGVKYFIQKNSFRSKHANYDIIKNDSFVSLRFIGTYI- 

SEQ2552 MRYDDPEGDYGRQKRQREVI QKVLKKILALNS I SS YKKI LSAVSNNMQTNI EI SSKTI P 

SEQ2553 MRYDDPEGDYGRQKRQREVI QKVLKKILALNS I SS YKKI LSAVSNNMQTNI EIS SKTI P 

SEQ2554 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I SS YKKI LSAVSNNMQTNI EISSKTIP 

SEQ2555 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I S S YKKI LSAVSNNMQTNI E I SS KT I P 

SEQ2556 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I S S YKKI LSAVSNNMQTNI EIS SKTI P 

SEQ2557 MRYDDPEGDYGRQKRQREVI QKVL KKI LALNS I S S YKKI L S AVSNNMQTNI E I S SKTI P 

SEQ2558 MRYDDPEGDYGRQKRQREVI QKVL KKI LALNS I S S YKKILSAVSNNMQTNIEI S SKTI P 

SEQ2559 MRYDDP EGD YGRQKRQREVI QKVLKKI LALNS I S S YKKI LSAVSNNMQTNI E I S S KTI P 

SEQ2 55 0 LLAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLIiAVQNRIKKELDKKRSKTLKTS 

SEQ2551 L-SVERRRYFIRWWLLSNFNETSTCSSKNERTRKANSEDKRDSIRLLWYYCFFYLFINT 

SEQ2 552 LLAYKDSLEHI KS YQLKGEDATLSDGGS YQILTKKHLLAVQNR I KKELDKKRSKTLKTS 

SEQ2553 LLAYKDSLEHI KS YQLKGEDATLSDGGS YQILTKKHLLAVQNRI KKELDKKRSKTLKTS 

SEQ2554 LLAYKDSLEHI KSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 

SEQ2555 LLAYKDS LEHI KS YQLKGEDATLSDGGS YQ I LTKKHLLAVQNR I KKELDKKRSKTLKTS 

SEQ2556 LLAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 

SEQ2557 LLAYKDS LEHI KSYQLKGEDATLSDGGSYQ I LTKKHLIiAVQNRI KKELDKKRSKTLKTS 

SEQ2 558 LLAYKDSLEHIKS YQLKGEDATLSDGGS YQILTKKHLLAVQNR I KKELDKKRSKTLKTS 

SEQ255 9 LLAYKDSLEHIKS YQLKGEDATLSDGGS YQILTKKHLLAVQNRIKKELDKKRSKTLKTS 

SEQ2550 IL YED Y YGTTASNDSS TYS STQENNYNTTP YS EAP P S YSGNTT YS S ETNQTTHQNY YNS 

SEQ2551 ELY NTLFRST TKLQW YYLFDSNNSSKLLHS CLQH 

SEQ2552 ILYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 

SEQ2 553 I L YED YYGTTASNDS S TYS STQENNYNTTP YS EAPPSYSGNTT YSSETNQTTHQNYYNS 

SEQ2554 IL YED Y YGTTASNDSS TYSSTQENNYNTTPYS EAPPSYSGNTT YSSETNQTTHQNYYNS 

SEQ2555 ILYED YYGTTASNDS S TYS S TQENNYNTTP YS EAPP S YS GNTT YFD SNNS S KLL 

SEQ255 6 ILYEDYYGTTASNDSSTYSSTQENYYYTTPLFRSTTKLQWYYLFDSNNSSKLLHSCLQH 

SEQ2557 ILYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 

SEQ255 8 ILYED YYGTTASNDS S TYS S TQENNYNTTP YS EAP P S YS GNTT YS S ETNQTTHQNY YNS 

SEQ2559 ILYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQSYYNS 

SEQ255 0 TPASNYSSNTNTGQADSSGSVNNHNGAATPNP 

SEQ2 55 1 RSGFKWKCQSWGCNAS 

SEQ2552 TPASNYSSNTNTGQADSSGSVNNHNGAATPNP- 

SEQ2553 TPASNYSSNTNTGQADSSGS VNNHNGAATPNP 

SEQ2554 TPASNYSSNTNTGQADSSGSVNNHNGAATPNP 

SEQ2555 

SEQ2 55 6 RSGFKWKCLRGCNASKHRNATSTRSNS 

SEQ2 5 5 7 TPASNYSSNTNTGQADS SGSVNNHNGAATPNP 

SEQ2558 TPASNYSSNTNTGQADS SGSVNNHNGAATPNP 

SEQ2559 TPASNYSSNTNTGQADS SGSVNNYNGAATPNPNTGTQPVPGQTNP 
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Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ ID NO. 2601: SAG0503 FROM THE 090 GBS TYPE la STRAIN 
(REVERSE COMPLEMENT) 

GGGCACAAGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAA 
AGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATAC 
AAC CT CTCAAGGTGGTT TTGTC CCAC TGCTATCAGAATCACTCCATAATCGATAC TCTTAC CAAGTGACTTCTGTTAA 
TTATGGTGTGT C TGGGAATACTAGTCAACAAATTTT AAAACGTATGACGA£AGATCCTC AAAT CGAAAAAGATTTAGA 
GAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATC 
ACTAAATTC C TTTGAGAAAC C AGC AGAAGCATATAAGGAACGTTTGAAAGAAATACTTGCAAAAGC AAGACAAGATAA 
TCCTAAATTGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAAC 
CGT TATTGATAAT TGGAATAAAGC TACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGA 
CCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGC 
TCTCTTTACTGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 
TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAG 

SEQ ID NO. 2602: SAG0503 FROM THE H36b GBS TYPE lb STRAIN 
(REVERSE COMPLEMENT) 

TTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCCT 
AACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCA 
AGGTGGTTTTGTTCCACTGCTATCAGAATCACTCCATAATCGATACTGTTACCAAGTGACTTCTGTTAATTATGGTGT 
GTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTGA 
TTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTC 
C TTTGAGAAAC C AGC AGAAGCATATAAGGAACGTTTGAAAGAAATCC TTGCAAAAGCAAGACAAGATAATCCTAAATT 
GCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTGA 
TAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACCGCCTTTA 
TAAGGGAATAAATGGTAAAGAGGGTATTATAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTAC 
TGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAATGAAACAAG 
AAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAGTGGTCC 

SEQ ID NO. 2603: SAG0503 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE 
COMPLEMENT) 

GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCC 
TAAC AAAGAAAGTTAT CC C ACTTAACTATGTTGCTC TTGGAGATTCTC TGAC CGAAGGTGTGGGCGATACAACCTCTC 
AAGGTGGTTTTGTTCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGGTG 
TGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTG 
ATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATT 
C CTT TGAGAAAC C AGCAGAAGCATATAAGGAACGTT TGAAAGAAATCC TTGC AAAAGCAAGAC AAGATAATC CTAAAT 
TGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTG 
ATAATTGGAATAAAGC TACAAAAGAAGTAGTTGATG C TTCAGAAAATGTTTATTTTGTC C C AATTAATGACCGC CTTT 
ATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTA 
CTGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAATGAAACAA 
GAAAAAACTGGCCGAACCCAGCTTTCTTGTACAA 

SEQ ID NO. 2604: SAG0503 FROM THE COHl GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

GGACAAGTTTGTACAAAAAAGCAGGC TCTATTTT TT C CTTGATCATTC C AAAATC AAATCCTAAATTAACAAAAAAAG 
ACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAA 
CCTCTCAAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATT 
ATGGTGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGA 
AAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCAC 
TAAATTC CT TTGAGAAAC C AGC AGAAGC ATATAAGGAACGTTTGAAAG AAATTCTTGC AAAAGC AAGACAAGATAATC 
CTAAATTGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCG 
TTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACC 
G CCTTTATAAGGGAATAAATGGTAAAGAGGGTAT TACAGAGTCATC AAATAGTCAGGCAAGTATC AC TAATGATGCTC 
TCTTTACTGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAATG 
AAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 
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Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ ID NO. 2 605: SAG0503 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCC 
TAAC AAAGAAAGTTATC CCACTTAACTATGTTGC TC TTGGAGATTCT CTGAC CGAAGGTGTGGGCGATACAAC CTCTC 
AAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGGTG 
TGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTG 
ATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATT 
CCTTTGAGAAAC CAGCAGAAGC ATATAAGGAACGTT TGAAAGAAATAC T TGC AAAAG CAAGACAAGATAATCCTAAAT 
TGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTG 
ATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACCGCCTTT 
ATAAGGGAAT AAATGGTAAAGAGGGTATTACAGAGT CAT CAAATAGTC AGGC AAGTATCACTAATGATGCTC TC TTTA 
C TGGAGAC C ATT TTCATC C C AATAATATTGGCTATC AAATC ATGT C TAACGC CGTTATGG AGAAAAT AAATGAAACAA 
GAAAAAACTGGCCGAACCCAGCTTTCTTGTACAA 

SEQ ID NO. 2606: SAG0503 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCC 
TAACAAAGAAAGTTATCCCAC TTAACTATGTTGC TC TTGGAGATT C TC TGAC CGAAGGTGTGGGGGATACAAC CTCTC 
AAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGGTG 
TGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTG 
ATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATT 
C CTTTGAGAAACCAGC AGAAGCATATAAGGAACGTTTGAAAGAAATTCTTGC AAAAGCAAGACAAGATAATC CTAAAT 
TGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTG 
ATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACCGCCTTT 
ATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGT CATCAAATAGTCAGGCAAGTATCAC TAATGATGC TC TC TTTA 
CTGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAATGAAACAA 
GAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ ID NO. 2607: SAG0503 FROM THIS JM9130013 GBS TYPE VIII STRAIN 
(REVERSE COMPLEMENT) 

GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCC 
TAAC AAAGAAAGTTAT CC CACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAAC CTCTC 
AAGGTGGTTTTGTTCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGGTG 
TGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTG 
ATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATT 
CCTTTGAGAAACCAGCAGAAGCATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAAT 
TGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTG 
ATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGC TTC AGAAAATGTTTATTTTGT CC C AATTAATGAC CGC CTTT 
ATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTA 
CTGGAGAC CATTTT CATCCC AATAATATTGGCTATC AAATCATGTC TAACGC CGTTATGGAGAAAATAAATGAAACAA 
GAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ ID NO. 2608: SAG0503 FROM THE 2603 V/R GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

AGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTC 
C TAACAAAGAAAGTTATC C CACTTAACTATGTTGCTCTTGGAGATTC T CTGAC CGAAGGTGTGGGCGATACAAC CTCT 
CAAGGTGGTTTTGTTC C ACTGCTATCAGAAT CACTC CATAATCGATAC T C TTAC CAAGTGACTTCTGTTAATTATGGT 
GTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCT 
GATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAAT 
TCC TTTGAGAAAC CAG C AGAAGC ATATAAGGAACGTTTGAAAGAAATC CTTGCAAAAGCAAGACAAGATAAT CC TAAA 
TTGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATT 
GATAATTGGAATAAAG CTAC AAAAGAAGTAGTTGATGCTT C AGAAAATGTTTATTTTGTC C CAATTAATGAC CGCCTT 
TATAAGGGAATAAATGGTAAAGAGGGTATTAC AGAGTCATCAAATAGT CAGGCAAGTATC ACTAATGATGC TC TC TTT 
AC TGGAGAC C AT TTTCATCC C AAT AATATTGGCTATCAAATCATGTCT AACGCCGTTATGGAGAAAATAAATGAAAC A 
AGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAGTGG 
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Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ ID NO. 2609: SAG0503 FROM THE M781 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

GGACAAGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAG 
ACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAA 
CCTCTCAAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATT 
ATGGTGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGA 
AAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCAC 
TAAATTCCTTTGAGAAACCAGCAGAAGCATATAAGGAACGTTTGAAAGAAATTCTTGCAAAAGCAAGACAAGATAATC 
CTAAATTGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCG 
TTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACC 
GCCTTTATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTC 
TCTTTACTGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAATG 
AAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 



SEQ2 601 GGCACAAGTTTGTACAAAA^GCAGGCTCTATTTTTTCCTT^ 

SEQ2602 TTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2603 GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2 604 GGACAAGTTTGTACAAAAA&GCAGGCTCTATTTTTT 

SEQ2 605 GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ 2 606 - GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2 607 GTTTGTACAAAAAAGC^GGCTCTATTTTTTCCTTGATCATTCCMAATCAAA 

SEQ2 608 AGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2 609 GGAC^GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2 601 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCC^CTTAACTATGTTGC 

SEQ2 602 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 603 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 604 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 605 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 606 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2607 TCCTAAATTAAOUU^AAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2608 TCCTAAATTAACyy^AAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ 2 609 TCCTAAATTAACAAAAAAAGACTTCCTAAGAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ 2 601 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 602 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTTCC 

SEQ2 603 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTTCC 

SEQ 2 6 04 TCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 605 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 606 TCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2607 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTTCC 

SEQ2 608 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTTCC 

SEQ2 609 TCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 601 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 602 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 603 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 6 04 ACTGCTATCAGAATCACTCCATAATCGATACtCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 6 0 5 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 606 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 607 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 608 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 609 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 601 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2602 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 603 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2604 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 605 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 606 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 6 0 7 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2608 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 609 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 601 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

SEQ2 602 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 



618 



WO 2004/018646 



PCT/US2003/026827 



Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ2 603 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

SEQ2 604 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

SEQ2 605 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

SEQ2 606 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

SEQ2 607 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

SEQ2 608 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

SEQ2 609 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 



SEQ2601 
SEQ2602 
SEQ2603 
SEQ2604 
SEQ2605 
SEQ2606 
SEQ2607 
SEQ2608 
SEQ2609 



TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 



SEQ2601 
SEQ2602 
SEQ2603 
SEQ2604 
SEQ2605 
SEQ2606 
SEQ2607 
SEQ2608 
SEQ2609 



ATATAAGGAACGTTTGAAAGAAATACTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATTCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATACTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAAC GT T TGAAAGAAAT T C T TG CAAAAGCAAGA CAAGATAAT C CTAAAT TGC C 
ATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATTCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 



SEQ2601 
SEQ2602 
SEQ2603 
SEQ2 604 
SEQ2605 
SEQ2606 
SEQ2607 
SEQ2608 
SEQ2609 



TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATT.TATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 



SEQ2601 
SEQ2602 
SEQ2603 
SEQ2604 
SEQ2605 
SEQ2606 
SEQ2607 
SEQ2608 
SEQ2609 



GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 



SEQ2601 
SEQ2602 
SEQ2603 
SEQ2604 
SEQ2 60 5 
SEQ2606 
SEQ2607 
SEQ2608 
SEQ260 9 



TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 



SEQ2601 
SEQ2602 
SEQ2603 
SEQ2604 
SEQ2605 
SEQ2606 
SEQ2607 
SEQ2 60 8 
SEQ2609 

SEQ2 601 
SEQ2602 



TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TATAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCAT CAAATAGTCAGGCAAGTAT CACTAATGATGCT CTCTT TACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCXCTCTTTACTGGAGACCA 
TACAGAGTCATGAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 

TTTTC^TCCC^TAATATTGGCTATC^^TCATGTCTAACGCCGTTATGGAGAAAATAAA 
TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 
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Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ2 603 TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 604 TTTTC^TCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SBQ2 605 TTTTCATCCC^TAATATTGGCTATCAAATCRTGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 606 TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SKQ2 607 TTTTC^TCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 608 TTTTCATCCC^TAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 609 TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2601 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAG 

SEQ2602 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAGTGGTCC 

SEQ2603 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAA 

SEQ2604 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ2 605 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAA 

SEQ2 606 TGAAACAAGAAAAAACTGGCCGAACCGAGCTTTCTTGTACAAA 

SEQ2607 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ2608 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAGTGG 

SEQ2 609 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAATABCMAI^TVSTNCSRA 

SEQ2601 

SEQ2602 

SEQ2603 

SEQ2604 

SEQ2605 

SEQ2606 

SEQ2607 

SEQ2608 

SEQ2 609 NGTSAGASACYHYDAS 

>SEQ ID NO 2650:103__090 frame: 2 

I FSLI I PKSNPKLTKKDFLTKKVT PLNYVALGDSLTEGVGDTTSQGGFVP 

LLS ESLHNRYS YQVTSVNYGV S GNTS QQ I LKRMTTDPQ I EKDLEKADLLTLTVGGNDVLA 

VIRKELSHLSLNSFEKPAEAYKERLKE ILAKARQDNPKLP I YVLGI YNPFYLNPPQLTKM 

QTVIDNWNKATKEVVDASENVYFVPINDRL YKGINGKEGI TESSNSQAS ITNDALFTGDH 

FHPNNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2651:103_H3 6B frame: 2 

I FSLI I PKSNPKLTKKDFLTKKVT PLNYVALGDSLTEGVGDTTSQGGFVPLLS 

ESLHJSTRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLT^ 

KELSHLSLNSFEKPAEAYKERLKEILAKARQDNPK^ 

IDNWNKATKEVVDASENVYFVPINDRL YKGINGKEGI I ESSNSQAS ITNDALFTGDHFHP 
NNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2 652 :I03__18RS21 frame: 3 

IFSLIIPKSNPKLTKivDFLTKKVIPLNYVALGDSLTEGVGDTTSQGGFVPLLS 

ESLHNRYSYQVTSVNYGVSGNTSQQILKPJ^TTDPQIEKDLEKADLLTLTVGGNDVLAVIR 

KELSHLSLNSFEKPAEAYKEPJuKEILAKARQDNPKLPIYVLGI\^PFYLNFPQLTKMQTV 

IDNWNKATKEVVDASENVYFVPINDFJ^YKGINGKEGIT^ 

NNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2653:103_COH1 frame: 3 

IFSLIIPKSNPKLTKKDFLTKK^IPIjNYVALGDSLTEGVGDTTSQGGFVPL 

LSESIiHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVI^ 

IRKELSHLSLNSFEKPAFAYKERLKEIIAKARQDNPKLPIYVLGIYNPFYLNFPQLTKMQ 

TVIDJJWNKATKEVVDASENVYFVPINDRLYKGINGK^^ 

HPNNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2654:103_CJB110 frame: 3 

I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS 
ESLHNRYSYQVTSVNYGVSGNTSQQILKPvMTTDPQIEKDLEKADLLTLTVGGNDVLAVIR 
KSLSHLSLNSFEKPAFJVYKERLKEILAKARQDNPKLPI YVLGI YNPFYLNFPQLTKMQTV 
IDN\WKATKEVVDASENVYFVPINDRLYKGINGKEGITESSNSQAS ITNDALFTGDHFHP 
NNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2655:103_1169NT frame: 3 

I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS 
ESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIR 
KELSHLSLNSFEKP AEAYKERLKEILAKARQDNPKLPI YVLGI YNPFYLNFPQLTKMQTV 
IDNWNKATKEVVDASENVYFVPINDRLYKGINGKEGITESSNSQAS ITNDALFTGDHFHP 
NNIGYQIMSNAVMEKINETRKNWP 
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Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 



>SEQ ID NO 2 656:103_JW9130013 frame: 3 

IFSLIIPKSNPKLTKKDFLTKKVIPLNYVALGDSLTEGVGDTTSQGGFVPLLS 

ESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTL™ 

KELSHLSLNS F EKPAEAYKERLKE I LAKARQDNPKL P I YVLGI YNPF YLNF PQLTKMQTV 

IDNWNKATKEVVDASENVYFVPINDRLYKGINGKEGITESSNSQASITNDALFTGDHFHP 

NNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2657:103_2603 frame: 1 

I FS L 1 1 PKSNPKLTKKDFLTKKVI PLNYVALGDS LTEGVGDTT SQGGFVPLL 

SESLHNRYSYQVTSWYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVIiAVI 

RKELSHLSLNSFEKPAEAYKBRLKSILAKARQDNPKLPIYVLGIYNPFYLNFPQLTKMQT 

VIDNWNKATKEVVDASENVYFVPINDRLYKGINGKEGITESSNSQASITNDALFTGDHFH 

PNNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2658:103_M781 frame: 3 

I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPL 

LSESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTV^ 

IRKELSHLSLNSFEKPAEAYKERLKEILAKARQDNPKLPIYVLGIYNPFYLNFPQLT 

TVIDNVmKATKEVVDASFJ^FVPINDRLYKGINGKEGITESSNSQASITNDALFTGDHF 

HPNNIGYQIMSNAVMEKINETRKNWP 



SEQ2650 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2654 
SEQ2655 
SEQ2656 
SEQ2657 
SEQ2658 

SEQ2650 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2654 
SEQ2 655 
SEQ2 65 6 
SEQ2657 
SEQ2658 

SEQ2 65 0 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2 654 
SEQ2655 
SEQ265 6 
SEQ2657 
SEQ2658 

SEQ2650 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2654 
SEQ2655 
SEQ2656 
SEQ2657 
SEQ2658 

SEQ2650 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2654 
SEQ2655 
SEQ2656 
SEQ2657 
SEQ2658 



I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
IFSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS ESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS ESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
IFSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 
SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 
SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 
SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 
SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVG^ 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLWGGNDVLAVIRKELSHLS 
SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLWGGNDVLAVIRKELSHLS 
SYQWSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 
SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 

LNSFEKPAEAYKERLKEILAKARQDNPKLPIYVLGIYNPFYIiNFPQLTKMQTVI^ 

LNSFEKPAEAYKERLKEILAKARQDNPKLPIYVLGIYNPFYLNFPQLTKMQTVIDNWNKA 

LNS FEKPAEAYKERLKEI LAKARQDNPKLP I YVLGI YNP FYLNFPQLTKMQTVI DNWNKA 

LNS FEKPAEAYKERLKE I LAKARQDNPKL P I YVLGI YNP F YLNFPQLTKMQTVI DNWNKA 

IjNS FEKPAEAYKERLKE ILAKARQDNPKLP I YVLGI YNPF YLNFPQLTK^ 

LNS FEKPAEAYKERLKE I LAKARQDNPKLP I YVLGI YNPF YLNFPQLTKMQTVI DNWNKA 

LNSFEKPAEAYKERLKEILAKARQDNPKLPIYVLGIYNPFYLNFPQL^ 

IiNSFEKPAEAYKERLKEILAKARQDNPKLPIYVLGIYNPFYLNFPQLTKMQWIDNWNKA 

LNS FEKPAEAYKERLKE I LAKARQDNPKL PI YVLGI YNP FYLOT 

TKEWDASENVYFVPINDRLYKGINGKEGI TESSNSQAS I TNDALFTGDHFHPNNIGYQI 
TKEWDAS ENVYFVP INDRL YKGI NGKEGI I E S SNS QAS I TNDAL FTGDHFHPNNI GYQ I 
TKEWDAS ENVYFVP I NDRL YKGI NGKEGI TESSNSQAS I TNDALF TGDHFHPNNI GYQ I 
TKEWDASENVYFVPINDRLYKGINGKEGITESSNSQASITNDALFTGDHFHPNNIGYQI 
TKEWDASENVYFVP INDRLYKGINGKEGITESSNSQAS I TNDALFTGDHFHPNNIGYQI 
TKEWDAS ENVYFVP INDRL YKGI NGKEGI TE S SNSQAS I TNDALFTGDHFHPNNI GYQ I 
TKEVVDAS ENVYFVP INDRLYKGINGKEGITESSNSQAS I TNDALFTGDHFHPNNIGYQI 
TKEWDAS ENVYFVP I NDRL YKGI NGKEGI TE S SNSQAS I TNDAL FTGDHFHPNNI GYQ I 
TKEWDASENVYFVPINDRLYKGINGKEGI TESSNSQAS I TNDALFTGDHFHPNNI GYQ I 

MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 



621 



WO 2004/018646 



PCT/US2003/026827 



Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 

SEQ ID NO. 2701: SAG1473 FROM THE 1169NT1 GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

GATACAAGTGATAAGAA.TACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 
CGTCAACTAATCCACCTACAACAGAACCATCGC^ 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 

TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAAT 

GTGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2702: SAG1473 FROM THE 18RS21 GBS TYPE II STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 

ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTGATCAAGTGAACCAGAAACAAATC 

CGTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGA 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 

TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTC^^AAGC^ 

ATGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2703: SAG1473 FROM THE 2603 V/R GBS TYPE V STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 
CGTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGTAGA 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAA 
ATGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2704: SAG1473 FROM THE 090 GBS TYPE la STRAIN 

GACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATCCGTC 
AACTAATCCACCTACAACAGAACCATCGCAACC^ 

AGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAG 
AATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGA 
TGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2705: SAG1473 FROM THE A909 GBS TYPE la STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATTAGATGA 

ACTAGAC(^GTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 

CCTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGTAGC 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 

TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAA^ 

ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2706: SAG1473 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 

ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 

CGTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGTAGA 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 

TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAA 

ATGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 
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Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 

SEQ ID NO. 2707: SAG1473 FROM THE C0H1 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGA 

CCTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAG^^ 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAA 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGAACGCGATGAATCATCATC 
ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2708: SAG1473 FROM THE H36b GBS TYPE lb STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATTAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 
CCTCAACTAATCCACCTACAAGAGAACCATCGCAA 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAr AAGTGGAT CGCGATGAAT CATCAT CTT CAAAAGCAA 
ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2709: SAG1473 FROM THE JM910013 GBS TYPE VIII STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATTAGATGA 

ACTAGACGA.GTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAAGCAGAAACAAATC 

CCTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGTAGC 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 

TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAA 

ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2710: SAG1473 FROM THE M732 GBS TYPE III STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCAAGTTCATCAAGTGAACCAGAAACAAATC 
CCTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGGAGC 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGAACGCGATGAATCATCATCTTCAAAAGCAA 
ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2711: SAG1473 FROM THE M781 GBS TYPE III STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCAAGTTCATCAAGTGAACCAGAAACAAATC 
CCTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGGAGC 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATT TTAGTAAAGCAAGTAGTGAT CAAGAAGAAGTGGATCGCGATGAAT CATCAT CT TCAAAAGCAA 
ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 



SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



ATAC AAGTGATAAGAATACTGAC ACGAGTGTC GTGACTACGAC C T T ATCTGAGGAGAAA 
ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 
ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 

ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 
ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 
ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 
ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 
ATACAAGTGATAAGAATACTGACACGAGTGTCGTGAC TACGAC C TTATCTGAGGAGAAA 
ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 
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Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 



SEQ27 01 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ27 07 
SEQ2709 
SEQ2710 
SEQ2711 



GAT C AGATGAACTAGAC C AGTCTAGTACTGGT TC TTC TTCTGAAAATGAATCGAGTTCA 
GAT C AGATGAACTAGAC CAGTC TAGTAC TGGT TC TTC TTCTGAAAATGAATCGAGT TCA 
GATCAGATGAACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCA 

GACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCA 

GATTAGATGAACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCA 
GATC AGATGAACTAGAC C AGTCTAGTACTGGT TC TTCTTCTGAAAATGAATCGAGT TCA 
GATC AGATGAACTAGAC C AGTCTAGTACTGGT TCTTC TTC TGAAAATGAATCAAGTTCA 
GATTAG ATGAAC TAGAC C AGTCTAGTAC TGGT TCTTCTTC TGAAAATGAATCGAGTT CA 
GAT C AGATGAAC TAGAC C AGTCTAGTAC TGGTT C TTCT TCTGAAAATGAAT CAAGTTC A 
GAT CAGATGAACTAGACCAGTC TAGTAC TGGTTCTT CTTCTGAAAATGAATCAAGT TCA 



SEQ2701 
SEQ2702 
SEQ27 03 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



TCAAGTGAACCAGAAACAAATCCGTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TC AAGTGAACC AGAAACAAAT CCGT CAACTAATCC AC C TACAAC AGAACCAT CGCAAC CC 
TCAAGTGAACCAGAAACAAATCCGTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TCAAGTGAACCAGAAACAAATCCGTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TC AAGTGAAC CAGAAACAAATC C CTCAACTAATC CACCTACAAC AGAACCAT CGCAACCC 
TCAAGTGAAC C AGAAACAAATC CGT C AACTAATC C AC C TACAACAGAAC C ATCGCAAC CC 
TC AAGTGAAC CAGAAACAAATCC CT CAACTAAT C CAC CTAC AAC AGAAC CATCGCAACCC 
TCAAGTGAACCAGAAACAAATCCCTCAACTAATCCACCTACAAC AGAACCAT CGCAACCC 
TCAAGTGAACCAGAAACAAATCCCTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TCAAGTGAACCAGAAACAAATCCCTCAACTAATCCACCTACAACAGAACCATCGCAACCC 



SEQ2701 
SEQ2702 
SEQ27 03 
SEQ2704 
SEQ27 05 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



TC AC C TAGTGAAGAGAACAAGCCTGATGGTAGAACGAAGACAGAAATTGGCAATAATAAG 
T C AC CTAGTGAAGAGAAC AAGCC TGATGGTAGAACGAAGACAGAAATTGGCAATAATAAG 
TCAC CTAGTGAAGAGAACAAGC CTGATGGTAGAACGAAGAC AGAAATTGGC AATAATAAG 
TCAC CTAGTGAAGAGAACAAGC C TGATGGTAG AACGAAGACAGAAATTGGCAATAATAAG 
TCACCTAGTGAAGAGAACAAGCCTGATGGTAGCACGAAGACAGAAATTGGCAATAATAAG 
TCAC C TAGTGAAGAGAAC AAGC CTGATGGTAGAACGAAGACAGAAATTGGC AATAATAAG 
TCACCTAGTGAAGAGAACAAGCCTGATGGGAGCACGAAGACAGAAATTGGCAATAATAAG 
T C A C C T AG TG AAG AG AA C AAG C C T GAT GGT AG C AC G AAG A C AGAAAT T GG C AATAATAAG 
TCACCTAGTGAAGAGAACAAGCCTGATGGGAGCACGAAGACAGAAATTGGCAATAATAAG 
TCACCTAGTGAAGAGAACAAGCCTGATGGGAGCACGAAGACAGAAATTGGCAATAATAAG 



SEQ2701 
SEQ27 02 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA- 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA. 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATT T CTAGTGGAACAAAAGTATTAATTTC AGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 



SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



GC AAGT AGTGATC AAGAAGAAGTGGATC GCGATGAAT C ATCATC TTC AAAAGC AAGTGAT 
GCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGAT 
G C AAGT AG T GAT C AAGAAGAAGTG GAT C G C GAT G AAT CAT CAT C T T C AAAAGC AAATG AT 
G C AAGT AGTGAT CAAGAAGAAGTGGATCGCGATGAAT CAT CATC TTCAAAAGCAAATGAT 
G C AAGTAGTGAT CAAGAAGAAGTGGATCGCGATGAATC ATCATC TT CAAAAGCAAATGAT 
GCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGAT 
GCAAGTAGTGATCAAGAAGAAGTGGAACGCGATGAATCATCATCTTCAAAAGCAAATGAT 
GCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGAT 
GC AAGTAGTGATCAAGAAGAAGTGGAACGCGATGAAT C ATC AT CT T CAAAAGCAAATGAT 
GCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGAT 
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SEQ27 01 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 

SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 

SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 

SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 

SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 

GGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GGG AAAAAAGG C C AC AGT AAG C C TAAAAAGGAA 

GGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GGGAAAAAAGG CCACAGTAAGC CTAAAAAGGAA 

GAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAAS GATACAAGTGATAAGAATAC TGACAC 

GAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GAGAAAAAAGGCCAC AGTAAGC C TAAAAAGGAA 

GAGAAAAAAGGC CACAGTAAGC CTAAAAAGGAATAB CMARAT VS TNC S RATNGT S AGCWA 



AGTGTCGTGAC TACGAC CTTAT CTGAGGAGAAAAGATTAGATGAACTAGAC C AGTC TAG 



TRAC ANCHRAM YRTN • 



ACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATCCCTC 



ACTAATC C AC CTACAAC AGAACC ATCGC AACC CTCAC CTAGTGAAGAGAACAAGCC TGA 



GGTAGC ACGAAGAC AGAAATTGGCAAT AATAAGGATATT TCTAGTGGAAC AAAAGT AT T 
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Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 



SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ27 05 
SEQ27 06 
SEQ27 07 
SEQ27 09 
SEQ2710 
SEQ2711 



ATTTCAGAAGATAGTATTAAGAATTTTAGTAAAGCAAGTAGTGATCAA.GAARAAGTGGA 



SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



CGCGATGAATCATC ATC TTC AAAAGC AAATGATGAGAAAAAAGGCCACAGTAAGC C TAA 



SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



AAGGAA 



>SEQ ID NO 2750:4JL169NT frame: 1 

DTSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKASD 
GKKGHS KPKKE 

>SEQ ID NO 2751:4JL8RS21 frame: 1 

DTSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGRTKTEIGKTMKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 
GKKGHS KPKKE 

>SEQ ID NO 2752:4_2603 frame: 1 

DTSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 
GKKGHS KPKKE 

>SEQ ID NO 2753:4^090 frame: 1 

DQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQPSPSEENKPDGRTKTEIGNNKDISSG 
TKVLISEDS IKNFSKASSDQEEVDRDES S SSKANDGKKGHS KPKKE 

>SEQ ID NO 2754:4_A909 frame: 1 

DTSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGSTKTEIGNNKDISSGTK^LISEDSIKNFSKASSDQEEVDRDESSSSKAND 
EKKGHS KPKKE 

>SEQ ID NO 2755:4_CJB110 frame: 1 

DTSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 
GKKGHS KPKKE 
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Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 

>SEQ ID NO 2756:4_COHl frame: 1 

DTSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGSTKTEIGNNKDI S SGTKVLI SEDS I KNFSKASSDQEEVERDESSSS KAND 
EKKGHS KPKKE 

>SEQ ID NO 2757*4_H36B frame: 1 

DTSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGSTKTEIGNNKDI S SGTKVLI SEDS IKNFSKASSDQEXVDRDESSS S KAND 
EKKGHSKPKKE 

>SEQ ID NO 2758 :4_JM913 00X3 frame: 1 

DTSDKNTDTSVVTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPS EENKPDGSTKTEI GNNKDI S SGTKVLI SEDSI KNPSKASSDQEEVDRDESS S SKAND 
EKKGHSKPKKE 

>SEQ ID NO 2759:4_M732 frame: 1 

DTSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPS EENKPDGS TKTE I GNNKDI S SGTKVLI S EDS I KNF S KASSDQEEVERDE SS S S KAND 
EKKGHSKPKKE 

>SEQ ID NO 2760:4_M7 81 frame: 1 

DTSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPS EENKPDGSTKTEIGNNKDI SSGTKVLI SEDS IKNFSKASSDQEEVDRDESS SSKAND 
EKKGHSKPKKE 



SEQ275 0 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2751 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2752 TSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ275 3 DQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2754 TSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2755 TSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2756 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2757 TSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2758 TSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2759 TSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ27 6 0 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2750 S PS EENKPDGRTKTEI GNNKDI SSGTKVLI SEDS I KNFSKASSDQEEVDRDESSSSKASD 

SEQ2751 SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 

SEQ2752 SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 

SEQ2753 SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 

SEQ2754 SPSEENKPDGSTKTEIGNNKDI SSGTKVLI SEDS IKNFSKASSDQEEVDRDESS SSKAND 

SEQ2755 SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 

SEQ2756 SPSEENKPDGSTKTEIGNNKDI SSGTKVLI SEDS I KNFSKASSDQEEVERDESSSS KAND 

SEQ2757 SPSEENKPDGSTKTEIGNNKDI SSGTKVLI SEDSI KNFSKASSDQEXVDRDESS SSKAND 

SEQ2758 SPSEENKPDGSTKTEIGNNKDISSGTKVLI SEDS IKNFSKASSDQEEVDRDESS SSKAND 

SEQ2759 SPSEENKPDGSTKTEI GNNKDI SSGTKVLI SEDS IKNFSKASSDQEEVERDESS SSKAND 

SEQ27 6 0 SPSEENKPDGSTKTEI GNNKDI SSGTKVLI SEDS IKNFSKASSDQEEVDRDESS SSKAND 

SEQ2750 KKGHSKPKKE 

SEQ2751 KKGHSKPKKE 

SEQ2752 KKGHSKPKKE 

SEQ2753 KKGHSKPKKE 

SEQ2754 KKGHSKPKKE 

SEQ2755 KKGHSKPKKE 

SEQ2756 KKGHSKPKKE 

SEQ2757 KKGHSKPKKE 

SEQ2758 KKGHSKPKKE 

SEQ2 759 KKGHSKPKKE 

SEQ2760 KKGHSKPKKE 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SEQ ID NO. 2801: SAG1552 FROM THE 1169NT1 GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

TTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCT 
TCCTTAGCAGGTTATCATCACAACGATTTTCCTATTACTCAAAAAACGTATCGTGAGTGGTTCCATTTAATTTCCAAC 
ATGGGGGCAAATACTGTAAGAGTCAAAGTACCGATGAATGTTGCATTTTACGATGCTTTATATCACCACAACAAAGCA 
TCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCTATAACAGCTTTTAAT 
GATAATTATAGGGGGT ATTT AAAACGAGAAGCAAAAGGCGT TGTGGATAT TCTC C ATGGGC GTAAGCAAGTATGGAAT 
ACTGATTTTGGTAGCCGTCATTATCATTATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAAT 
AGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAAAACTTCTGCGGCA 
GCTAATC CATTTGAGGT C ATGC TAGC T CAAGTTATGGATGAATTGACAC ATTATGAGACAGCTAAATATGGTTGGCAA 
CATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCGTTATCGAAAACCATTTGAGGCACAGGCTCCTAAA 
TACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCAGGTATTTTTGCAGCATATAAAGCTATT 
GATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAGAATATCAGTAAAGAAGATAGACAAAAGATTAAA 
GAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGC 
TATTCGACAGCGAGAGGTATTGC C CAAAAAGAAATTGATAAACGTC CTCTGC CGATTAATGAAAAAGAAC AAGGT CAG 
CGTTTACTAGAAGATTATGAAT C TTTTATATC ATC CGGTAGTTTTGGAGCGACTATC AATGC ATGGCAAGACGATTGG 
AAT G C AAGGGC GT GG AAT A C AT C C T T CG C C AC AAAT AAAC AT AGTC AAT T C C T AT GGGGG G ATG C AC AAGT AT T T AAT 
CAAGGTT ATGGT T TAT T AGG C T T T AAAAAC G C AAAAC AT CAT T AT C AAGT T G ATGGT AAAAGAG GC AAAGG AG AGT GG 
AAACATCCTCTG 

SEQ ID NO. 2802: SAG1552 FROM THE 

ATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCT 
GAAAAACTAAAAGAAAAACGATTATT AC CAAT AGATATTAC ACCAAAAT C TGGTAGTAGAAAAATGAATGGT AGTAAG 
GTCACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGC 
TAT AATGC CTTAAAAGCGAACTAT CTTCGAC AGCTT AACGGTAAAGATTTTTATGCT TTC CC AC C AAAGAAGAAC AGT 
AGTAATTTTGAGCAGATCAATATGGTATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAG 
AGGTTCTTACCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAATTGATAGGCACCAAAAAACATTTGATTCACAA 
ACAGATATTTCGTTTGG AAAGGACTTTATAGAGGT CAGAATTC CGTGG CAGTTGTTGAATTTTTCTGATCC AT C ATCT 
C AAAAAAT T C AC G ATG ATT AC T T T AAAC AT T ATGGT GTGAAGG AGT TAG AAAT TGAGAG C ATT G C T TTAGG AT T AGGT 
GC T AAT AG C AAAG AAAAC AG AC T GAT AAAG AT GG CAG AT TAT C GT T TG AAAAAT TGGGAGAGAC C C GAT AC C AAAAC C 
TTTTTAAAAGAC T CCTATT ATAGT ATTTAAGAAAGAA 

SEQ ID NO. 2803: SAG1552 FROM THE 18RS21 GBS TYPE II STRAIN 

AAGGGCTTATTAAAAGAAAATACAAGAACTAACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAA 
CCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAA 
ACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGCA 
TTTTACGATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCT 
TATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
GATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCATTATGATCTTAGTCCTTGG 
GTACTTGGTTATGT CGTAGGGGATGATTGGAATAGTGGTACTGTCGC TTATACTAATC AT CAAGAGAAAAAAACGCAA 
T AT AAAGGACGT T AT TT T AAAAC T T C TGTGG CAG C T AAT C CAT T TG AGGT C ATG C T AGC T C AAGT AAT GG AT G AAT TG 
ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCAT 
TATCGAAAACCATTTGAGGCACAGGCTCCTAAATACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTT 
AAA.GCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAG 
AATATC AGTAAAGAAGATAGAC AAAAGATTAAAGAAC TTT CTTTGTC AC AGGGATACGTTAAAC TGCTAAATGC TT AT 
CACAAAATCCCTGTTCTAGTCACGGGTTATGGCTATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGT 
CCTCTGCCGATTAATGAAAAAGAAC AAGGTC AG CGT TTACTAGAAGAT TATGAAT CTTTTATAT C ATCCGGTAGTTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAAACATAGT 
CAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTAT 
CAAGTTGATGGTAAAAGAGGCAAAGGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCT 
AGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACCAATA 
GATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTATTG 
TC TATTGATC CAAATGGCAAGTCTGAATT ATTTGTC CAAGAGCGCTATAATGCCTTAAAAGCGAAC TAT CTT CGAC AG 
C TTAACGGTAAAG ATTTTTATGC TTTCC C AC CAAAGAAGAAC AGT AGT AATTTTGAGCAGAT AAAT ATGGT AT TGAGA 
AATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGGTCTTCTC 
AAAACAGGAACAACTGATAGGC AC CAAAAAAC ATTTGATTC AC AAACAGATATTTCGT TT GGAAAGGAC T T TAT AGAG 
GTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTAT 
GGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGATG 
GCAGATTATCGTTTGAAAAATTGGGAGAGAC C C GAT AC C AAAAC CTTT TTAAAAGACTC C TATT ATGTATTAAGAAAG 
AA 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SSQ ID NO. 2804: SAG1552 FROM THE 2603 V/R GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

TATTAAAAGAAAATACAAGAACTAA.CTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTG 
TTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAAACGTATC 
GTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGCATTTTACG 
ATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCA 
ATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTGGATATTC 
TCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCATTATGATCTTAGTCCTTGGGTACTTG 
GTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAATATAAAG 
GACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTGACACATT 
ATGAGAC AGCTAAATATGGTTGGC AACATTTGATTAGTTTTTCAAACT C AC CAACAACAGACCCTTTT C ATTATCGAA 
AAC CATTTGAGG CACAGGCT C C TAAATACGTAC AACTAAATGTAGAAAATATTC AAGCTAATTC AAATGTTAAAGCAG 
GTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAGAATATCA 
GTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTATCACAAAA 
TCCCTGTTCTAGTCACGGGTTATGGCTATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGC 
CGATTAATGAAAAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTTTGGAGCGA 
CTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAAACATAGTCAATTCC 
TATGGGGGGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTG 
ATGGTAAAAGAGGCAAAGGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAGTG 
ATGAAAGCTATC TCTACCTTGCGAT TAAAACAAAAC CTGAAAAACTAAAAGAAAAACGATTATTACCAATAGATATTA 
CACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTG 
ATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACG 
GTAAAGATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGTATTGAGAAATACAA 
AGATTGTTGAAGACATGGAAAAAGTAAAAGCAAC AGAGAGGTTC TTAC C AACTC ATCCTAC TGGT CTTC TC AAAACAG 
GAACAACTGATAGGCACCAAAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAA 
TTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTATGGTGTGA 
AGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGATGGCAGATT 
ATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTTTTAAAAGACTCCTATTATAGTATTAAGAAAGAATGGT 
CTAAAGAAAGAGAGAGAACATATGGTCCA 

SEQ ID NO. 2805s SAG1552 FROM THE A909 GBS TYPE la STRAIN 
(REVERSE COMPLEMENT) 

AAGGGCTTATTAAAAGAAAATACAAGAACT AACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGC CC AC C AATAAA 
CCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAA 
ACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGCA 
TTTTACGATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCT 
TATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
GATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCATTATGATCTTAGTCCTTGG 
GTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAA 
TATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTG 
ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCAT 
TATCGAAAAC CATTTGAGGCAC AGGCT CCTAAATACGTAC AAC TAAATGTAGAAAATATTC AAGCTAATTCAAATGTT 
AAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAG 
AATATCAGTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTAT 
CACAAAATCCCTGTTCTAGTCACGGGTTATGGCTATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGT 
C C T CTGCCGATTAATGAAAAAGAAC AAGGTC AGCGTTTACTAGAAGATTATGAATCTTTT ATATC ATC CGGT AGTTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAAACATAGT 
CAATTCCTATGGGGQGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTAT 
CAAGTTGATGGT AAAAGAGG C AAAGGAGAGTGGAAACATC CTCTGATGACTAGTGC AACAGGAGATG AC TTATATGCT 
AGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACCAATA 
GATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTATTG 
TCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAG 
C TTAACGGTAAAGAT TTTTATGCTTTC C CAC CAAAGAAGAAC AGTAGTAATTTTGAGCAG ATAAATATGGTATTGAGA 
AATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGGTCTTCTC 
AAAACAGGAACAACTGATAGGCACCAAAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAG 
GTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAGAATTCACGATGATTACTTTAAACATTAT 
GGTGTGAAGGAGTTAGAAAATTGAGAGCCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGA 
TGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTTTTAAAAGA 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SEQ ID NO. 2806: SAG1552 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

TATTACTTTGATGGTAGTTTGTATTTACCAAAGGGCTTATTAAAAGAAAATACAAGAACTAACTTTGTTGTTAAAGGT 

GATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTAT 

CATCACAACGATTTTCCTATTACTCAAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACT 

GTAAGAGTCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACT 

TATTTGTTGCAAGGAATACGTATAGATT CTTATCGC AATAATGCTT CTATAACAGC TTTTAATGATAATTATAGGGGG 

TATTTAAAACGAGAAGCAAAAGGCGTTGTGGATATTCTCCATGGGCGTAAGCAAGTATGGAATACAGATTTTGGTAGC 

CGTCATTATCATTATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGCT 

TATACTAATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAG 

GTCATGCTAGCTCAAGTAATGGATGAATTGACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTT 

TCAAACTCACCAACAACAGACCCTTTTCATTATCGAAAACCATTTGAGGCACAGGCTCCTAAATACGTACAACTAAAT 

GTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGA 

TACAAGGATTATCTATTATTTGATAAAGAGAATATCAGTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCA 

C AGGGATACGTTAAACTGCTAAATGCTTATC ACAAAAT C C C TGTTC TAGTCACGGGTTATGGC TATTCGACAGCGAGA 

GGTATTGCC C AAAAAGAAATTGATAAACGT CCT C TGCCGATTAATGAAAAAGAACAAGGTCAGCGTTTACTAGAAGAT 

TATGAATCTTTTATATCATCCGGTAGTTTTGGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGG 

AATACAT CTTTC GCC ACAAATAAAC ATAATCAATTC CTATGGGGGGATGCAC AAGTATTTAATCAAGGTTATGGTTTA 

TTAGGCTTTAAAAACGC AAAACAT CATTAT CAAGTTGATGGTAAAAGAGGC AAAGGAGAGTGGAAACATCC TCTGATG 

ACTAGTGCAACAGGAGATGACTTATATGCTAGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAA 

AAACTAAAAGAAAAACGATTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTC 

ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTAT 

AATGCCTTAAAAGCGAACTATCTT CGAC AGCTTAACGGTAAAGATTTTTATGCTTT C C CACCAAAGAAGAAC AGTAGT 

AATTTTGAGCAGATAAATATGGTATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGG 

TTCTTAC C AAC TC AT CC TACTGGTC TTC TCAAAACAGGAAC AAC TGATAGGC AC CAAAAAACATTTGATTCACAAAC A 

GATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAA 

AAAATTCACGATGATTACTTTAAACATTATGGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCT 

AATAGCAAAGAAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTT 

TTAAAAGACTC C TATTATGTATTAAGAAAGA 

SEQ ID NO. 2807: SAG1552 FROM THE COH1 GBS TYPE III STRAIN 

TTTACCACAGGGCTTATTAAAAGAAAATACAAGAACTAACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCAC 
CAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTAC 
T C AAAAAACGTATCGTGAATGGTTC CATTTAATTTC CAAC ATGGGGGC AAATAC TGTAAGAGTCAAGGTAC CGATGAA 
TGTTGCATTTTACGATGCCTTATATCACCACAACAAAGAATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTAT 
AGATTCTTATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGG 
CGTTGTGGATATTCTCCATGGGCGTAAGC AAGTATGGAATACTGATTTTGGTAGC CGTCATTATCAT TATGATC T TAG 
TCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAA 
AACGCAATATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGA 
TGAATTGACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCC 
TT T T C ATT AT CG AAAAC C AT TTGAGG C AC AGGC T C C T AAATAC GT AC AAC TAAATGTAGAAAAT AT T C AAG C TAAT T C 
AAATGTTAAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGA 
TAAAGAGAATAT C AGTAAAGAAGATAGAC AAAAGAT TAAAGAACTTTCTTTGTC AC AGGGATACGTTAAAC TGCTAAA 
TGCTTATCACAAAATCC CTGTTCTAGTCACGGGTTATGGC TATT CGAC AGCGAGAGGTAT TGCCC AAAAAGAAATTGA 
T AAACGT CC TCT GCCGATTAATGAAAAAGAAC AAGGTC AGC GTTTACTAGAAGATTATGAATC TTTTATATC ATCCGG 
TAGTTTTGGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAA 
ACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACA 
TCATTATCAAGTTGATGGTAAAAGAGGCAAAGGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTT 
ATATGCTAGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATT 
ACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTT 
TGTATTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCT 
TCGACAGCTTAACGGTAAAGATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
ATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGG 
TCTTCTCAAAACAGGAACAACTGATAGGCACCAAAAAACATTTGATTCACAACCAGATATTTCGTTTGGAAAGGACTT 
TATAGAGGTC AGAATTC CGTGGC AGTTGTTGAATTTTTCTG ATCCATC ATC TC AAAAAATT CACGATGATTACT TTAA 
ACATTATGGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGAT 
AAAGATGG CAGAT TAT CGTTTGAAAAATTGGGAGAGAC C CGATACC AAAACCTTTTTAAAAGACT 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SEQ ID NO. 2808: SAG1552 FROM THE H36b GBS TYPE lb STRAIN 

AAGGGGCTTAT TAAAAGAAAATAC AAGAAC TAACTTTGTTGTTAAAGGTGATACTGTACTTC ACAAGC CCAC C AATAA 
ACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAA 
AACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGC 
ATTTTACGATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTC 
TTATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGT 
GGATATT CT CC ATGGGCGTAAGCAAGTATGGAATAC TGATT TTGGTAGCAGT CATTATC ATTATGATCTTAGTCCTTG 
GGTACTTGGTTATGTCGTAGGGGATGATGGACATAGTGGTACTGTCGCTTTATACTAATCATCAAGAGGAGAAAAACG 
C AATATAAAGGACGTTATTTTAAAAC TTCTGTGGCAGC TAATCC AT TTGAGGTC ATGC TAGCTCAAGTAATGGATGAA 
TTGACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTT 
CATTATCGAAAACCATTTGAGGCACAGGCTCCTAAATACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCGAAT 
GTTAAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAA 
GAGAATATCAGTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCT 
TATC ACAAAATC CCTGTT CTAGTC ACGGGTTATGGCTAC T CGACAGCGAGAGGTATTGCC CAAAAAGAAATTGATAAA 
CGTC CT CTGCCGATTAATGAAAAAGAAC AAGGTC AGCGTTTACTAGAAGATTATGAATC TTTTATATCATC C GGTAGT 
TTTGGAGCGACTATC AATGC ATGG C AAGACGATTGGAATGC AAGGGTGTGGAATACATC C TTCGC CACAAATAAACAT 
AGTCAATTC CTATGGGGGGATGC ACAAGTATT TAAT C AAGGTTATGGTTTATTAGG CTTTAAAAACGCAAAACAT CAT 
TATCAGGTTGATGGTAAAAGAGGC AAAGAAGAGTGGAAACATC CTCTGATGAC TAGTGC AACAGGAGATGAC TTATAT 
GCTAGC AGTGATGAAAGCTATC TC TAC C TTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACC A 
ATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTA 
TTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAACGCCTTAAAAGCGAACTATCTTCGA 
CAGC TTAATGGT AAAGATTTTTATGCTTTC CC AC CAAAGAAGAAC AGTAGTAATTTTGAGC AGATAAATATGGTATTG 
AGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGGTCTT 
CTCAAAACAGGAACAACTGATAGGCACCAAAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATA 
GAGGTC AGAAT T C CGTGGC AGTTGTTGAATTTTT C TGAT C CATC AT CT CAAAAAATTC ACGATGATTACTTTAAACAT 
TATGGTGTGAAGGAGTTAGAAATTGAGAGC ATTGC TTTAGGATTAGGTGC TAATAGC AAAGAAAACACAC TGATAAAG 
ATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTTTTAAAAGACTCCTATTATAGT 

SEQ ID NO. 2809: SAG1552 FROM THE JM9130013 GBS TYPE VIII STRAIN 

ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGT 
CTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAAACGTATCGTGAATGGTTCCATTTAATTTCCA 
ACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAG 
CATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCTATAACAGCTTTTA 
ATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTGGATATTCTCCATGGGCGTAAGCAAGTATGGA 
ATACTGATTTTGGTAGCAGTCATTATCATTATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGA 
ATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAAAACTTCTGTGG 
CAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTGACACATTATGAGACAGCTAAATATGGTTGGC 
AACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCATTATCGAAAACCATTTGAGGCACAGGCTCCTA 
AATACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCAGGTATGTTTGCAGCATATAAAGCTA 
TTGATTTC CATC CTCGATAC AAGGATTATC TATTAT TTGATAAAGAGAATATC AGTAAAGAAGATAGAC AAAAGATTA 
AAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATG 
GCTACTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGAAAAAGAACAAGGTC 
AGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTTTGGAGCGACTATCAATGCATGGCAAGACGATT 
GGAATGC AAGGGTGTGGAATACAT C CTTCGC CAC AAATAAACATAGTC AATT C CTATGGGGGGATGC AC AAGTATTTA 
ATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAGGTTGATGGTAAAAGAGGCAAAGAAGAGT 
GGAAAC ATC CTCTGATGACTAGTG C AAC AGGAGATGACTT ATATGC TAGC AGTGATGAAAGCTAT CTC TAC CTTGCGA 
TTAAAACAAAAC CTGAAAAAC TAAAAGAAAAACGAT TATT ACC AATAGAT ATTACAC CAAAATCTGGTAGTAGAAAAA 
TGAATGGTAGTAAGGT C AC ATTTT CTAAATC TAGTGACTTTGTATTGTCTATTGATC C AAATGGC AAGTCTGAATTAT 
TTGTCCAAGAGCGCTATAACGCCTTAAAAGCGAACTATCTTCGACAGCTTAATGGTAAAGATTTTTATGCTTTCCCAC 
CAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGTATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAG 
TAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACCAAAAAA 
CATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCAGTTGTTGAATTTTT 
CTGATC C AT CAT CTC AAAAAATTC ACGATGATTACTTTAAAC ATTATGGTGTGAAGGAGTT AGAAATTGAGAGC ATTG 
CTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGAC 
C CGAT AC C AAAAC C TTT TT AAAAG ACT C C T ATT AT AGT AT T AAGAAAG 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SEQ ID NO. 2810: SAG1552 FROM THE M732 GBS TYPE III STRAIN 

TAC AAGAAC TAAC TTTGTTGTTAAAGGTGATACTGT ACT TCACAAGC C CAC CAATAAACC TTTTGTTGTTAAAGGAGT 
AGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAAACGTATCGTGAATGGTTCCA 
TTTAATTT C CAACATGGGGGC AAATAC TGTAAGAGT CAAGGTAC CGATGAATGTTGC ATT TTACGATGC C TTATATC A 
CCACAACAAAGAATCA^AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCTAT 
AACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTGGATATTCTCCATGGGCGTAA 
GC AAGTATGGAATACTGATTTTGGTAGCCGTCATTATC ATTATGAT CTTAGTC CTTGGGT AC TTGGTTATGT CGTAGG 
GGATGATTGCAATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAA 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTGACACATTATGAGACAGCTAA 
ATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCATTATCGAAAACCATTTGAGGC 
ACAGGCTCCTAAATACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCAGGTATGTTTGCAGC 
ATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAGAATATCAGTAAAGAAGATAG 
ACAAAAGATTAAAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGT 
CACGGGTTATGGCTATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGAAAA 
AGAACAAGqTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTTTGGAGCGACTATCAATGCATG 
GCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAAACATAGTCAATTCCTATGGGGGGATGC 
ACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGG 
CAAAGGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAGTGATGAAAGCTATCT 
CTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACCAATAGATATTACACCAAAATCTGG 
TAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAA' 
GTCTGAATTATTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAAAGATTTTTA 
TGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGTATTGAGAAATACAAAGATTGTTGAAGA 
CATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTAC C AAC T CATCCTAC TGGT CTTCTC AAAAC AGGAACAACTGATAG 
GCACCAAAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCAGTT 
GTTGAATTTTTCTGAT C CAT CATC TC AAAAAATT CACGATGATTACTTTAAAC ATTATGGTGTGAAGGAGTT AGAAAT 
TGAGAGCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAA 
TTGGGAGAGACCCGATACCAAAACCTTTTTAAAAGACTCCTATTATAGTATTAAG 

SEQ ID NO. 2811: SAG1552 FROM THE M781 GBS TYPE III STRAIN 

TTTGATGGTAGT T TGTATTTAC C ACAGGGCTTATTAAAAGAAAATAC AAGAACTAACTTTGTTGTTAAAGGTGATACT 
GTACTTCACAAGCCCACCAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCAC 
AACGATTTTCCTATTACTCAAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 
GTCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAGAATCAAAGAGGCCACTGTATTTG 
TTGC AAGGAATACGTATAGATTC T TAT CGCAATAATGC TTCTATAACAGCTT TTAATGAT AATTATAGGGGGTATTTA 
AAACGAGAAGCAAAAGGCGTTGTGGATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCCGTCAT 
TATC ATTATGATC TTAGTCCTTGGGTAC T TGGTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGC T TATACT 
AATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATG 
CTAGCTCAAGTAATGGATGAATTGAGACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAAC 
TC AC CAACAAC AGACC CTTTTC ATTATCGAAAAC C ATTTGAGGC AC AGGC TCCTAAATACGTACAACTAAATGTAGAA 
AATATTC AAGCTAATTCAAATGTTAAAGCAGGTATGTTTGCAGCATAT AAAGC TATTGAT TT CCATCCTCGATACAAG 
GATTATCTATTATTTGATAAAGAGAATATCAGTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCACAGGGA 
T ACGTTAAAC TGCTAAATG C TTAT C ACAAAATC CCTGTTCTAGT CACGGGTTATGGCTAT TCGACAGCGAGAGGTATT 
GC CCAAAAAGAAATTGATAAACGT C CTCTGC CGATT AATGAAAAAGAACAAGGTCAGCGTT TACTAGAAGATTATGAA 
TCTTTTATATCATCCGGTAGTTTTGGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACA 
TCTTTCGCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGC 
TTT AAAAACGCAAAACATC ATTAT C AAGT TGATGGTAAAAGAGGCAAAGGAGAGTGGAAAC ATC CT CTGATGACTAGT 
GCAAC AGGAGATGACTTATATGC TAGC AGTGATGAAAGCTATCT CTAC CTTGCG ATTAAAAC AAAACCTGAAAAACTA 
AAAGAAAAACGATTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTT 
TCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAATGCC 
TTAAAAGCGAACTATC TTCGACAG C TTAACGGTAAAGATTT TTATGCTTT C CC ACC AAAGAAGAACAGTAGTAATTTT 
GAGCAGATAAATATGGTATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTA 
CCAAC TC AT C CT AC TGGTC TTCTC AAAACAGGAAC AACTGAT AGGC AC C AAAAAAC ATT TGATTC ACAAAC AGATATT 
TCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAAAATT 
CACGATGATTACTTTAAACATTATGGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCTAATAGC 
AAAGAAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTTTTAAAA 
GACTC CTATTATAGTATTAAGAAAGAATGG 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 



SEQ2801 

SEQ2802 

SEQ2 803 AAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2804 TATTAAAAGAAAATACAAGAACT 

SEQ2805 AAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2 806 ATTACTTTGATGGTAGTTTGTATTTACCAAAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2807 TTTACCACAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2 808 AAGGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2809 

SEQ2810 TACAAGAACT 

SEQ2 811 TTTGATGGTAGTTTGTATTTACCACAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2801 --TTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2802 

SEQ2 803 ACTTTGTTGTTAAAGGTGATACTGTACTTC^CIAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 806 ACTTTGTTGTTAAAGGTGATACTGTACTTC^C^AGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 807 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 808 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 809 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 810 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2811 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 801 AAGGAGTAGACGTTGAGTCTTCCTTAGCAGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2802 

SEQ2 803 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2804 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2 805 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2 806 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2 807 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2808 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2809 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2810 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2811 AAGGAGTAGACGTTGAGTCTTC CTTAGCGGGTTAT CATCACAACGATTTTCCTATTACT 

SEQ2801 AAAAAACGTATCGTGAGTGGTTCCATTTAATTTCC^CATGGGGGC^^TACTGTAAGA 

SEQ2802 

SEQ2 803 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2804 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2805 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCC^(^TGGGGGCaAATACTGTAAGA 

SEQ2 806 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 807 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 808 AAAAAACGTATCGTGAATGGTT CC^TT TAATTTCCAACATGGGGGGAAATACTGTAAGA 

SEQ2 809 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2810 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2811 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 801 TCAAAGTACCGATGAATGTTGCATTTTACGATGCTTTATAT(^CCACAACAAAGCATaA 

SEQ2802 

SEQ2 803 TC^U^GGTACCGATGAATGTTGCATTTTACGATGCCTTATATCAC 

SEQ2 804 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACC^ 

SEQ2 805 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACC^CAACAAAG(^TCA 

2EQ2 806 TCAAGGTACCGATGAATGTTGGATTTTACGATGCCTTATATCACCACAAC^ 

SEQ2 807 TC^GGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAGAATCA 

SEQ2 8 0 8 TCAAGGTACCGATGAATGTTGC^TTTTACGATGCCTTATATCACCACAAC^^GC^TCA 

SEQ2 809 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACMCAAAGC^TCA 

SEQ2 810 TCAAGGTACCGATGAATGTTGC^TTTTACGATGCCTTATATC^CCACAACAAAGAATCA 

SEQ2 811 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAAC^AGAATCA 

SEQ2 801 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2802 

SEQ2 803 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 8 0 4 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 805 AGAGGCGACTGTATTTGTTGC!AAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2806 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 807 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 808 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 



SEQ2 809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2 809 
SEQ2810 
SEQ2811 



AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 
AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 
AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 



SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2 80 8 
SEQ2809 
SEQ2 810 
SEQ2811 



ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCCGTCATTATCAT 

ATGACTA - GT G CAA CAGGAGATGAC T TAT AT - GCTAGCAGTGATGAAAGC 

ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCAT 
ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCAT 
ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCAT 
ATATTCTCCATGGGCGTAAGCAAGTATGGAATACAGATTTTGGTAGCCGTCATTATCAT 
ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCCGTCATTATCAT 
ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCAGTCATTATCAT 
ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCAGTCATTATCAT 
ATATTCTC CATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAG CCGTCATTATCAT 
ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCCGTCATTATCAT 



SEQ2 801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2 805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 



TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 
TAT- -CTCTA- - CCTTGCG- ATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTAT 
TATGATCTTAGTCCTTGGGTACTTGGT TATGTCGTAGGGGATGATTGGAATAGTGGT - AC 
TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 
TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 
TATGATCTTAGTCCTTGGGTACTTGGT TATGTCGTAGGGGATGATTGGAATAGTGGT - AC 
TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT - AC 
TATGATCTTAGTCCTTGGGTACTTGGTTATGT CGTAGGGGATGATGGACATAGTGGT - AC 
TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 
TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGCAATAGTGGT-AC 
TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 



SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2 805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 



TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TACCAATAGATATTA- - CACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCAC 
TGTCGCTT - AT AC TAAT CATCAAGAGAA - AAAAA CG CAATATAAAG GA C - GTTATTTTAA 
TGTCGCTT - ATAC TAAT CAT CAAGAGAA -AAAAACGCAATATAAAGGAC- GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTTTATACTAATCATCAAGAGGAGAAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT -ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC -GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 



SEQ2 801 
SEQ2 802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2 80 8 
SEQ2809 
SEQ2 810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 



AACTTCTGCGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTTATGGAT - -GAATTG 
ATTTTCTAAATCTAGTGA- CTTTGTATTGTC- TATTGATCCAAATGGCAAGTCTGAATT- 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - -GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT- -GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT- -GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 

ACA(^TTATGAGACAGCTAAATATGGTTGGC^ACATTTGATTAGTTTTTCAAACTCACCA 
ATTTGTC - CAAGAGCGCTATA - ATGCCT - - TAAAAGCGAACTATCTTCGACAGCTTAACG 
ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 
ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 
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SEQ2 805 ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2 806 AC^CATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTC^AACTCACCa 

SEQ2 807 ACACATTATGAGACAGCTAAATATGGTTGGC^C^TTTGATTAGTTTTTCAAACTCACC^ 

SEQ2 808 ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2 809 AC^CATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2810 ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2 811 AC^CATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2 801 CAACAGAC CCTTTTCGTTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 802 TAAAGATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATCAATA 

SEQ2803 CAACAGAC CCTTTTCATTATCGAAAACCATTTG -AGGCACAGGCTCCTAAATA 

SEQ2 8 04 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 805 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 806 CAACAGAC CCTTTTCATTATCGAAAACCATTTG-AGGCACAGGCTCCTAAATA 

SEQ2 807 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 808 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2809 CAACAGAC CCTTTTCATTATCGAAAACCATTTG-AGGCACAGGCTCCTAAATA 

SEQ2 810 CAACAGAC CCTTTTCATTATCGAAAACCATTTG - AGGCACAGGCTCCTAAATA 

SEQ2811 CAACAGAC CCTTTTCATTATCGAAAACCATTTG-AGGCACAGGCTCCTAAATA 

SEQ2 801 G-TACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCA GGTATTT 

SEQ2 802 GGTATTGAGAAATACAAAG^TTGTTGAAGACATGGAAAAAGTAAAAGC^C^GAGAGGT 

SEQ2 803 G-TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 804 G - TAC!AACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2805 G-TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2806 G-TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 807 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 808 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCA GGTATGT 

SEQ2 809 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCA GGTATGT 

SEQ2 810 G- TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 811 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 801 TTGCAGC^TATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2802 TCTTACCAACTCATCCTACTGGTCTT CTCAAAACAGGAACAATTGAT - AGGCACCA 

SEQ2 803 TTGC!AGC^TATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2804 TTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 805 TTGCAGCATATAAAGCTATTGATTTCCATCCTCGATAC!AAGGATTATCTATTATTTGATA 

SEQ2 806 T T G CAG CA TATAAAG C TAT TGAT T T C CAT C CT CGATACAAGGAT TATC TAT TAT TTGAT A 

SEQ2 807 TTGCAGCATATAAAGCTATTGATTTCCIATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 808 TTGCAGCATATAAAGCTATTGATTTCC^TCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 809 TTGCAGCATATAAAGCTATTGATTTCCATCCT CGATACAAGGATTATCTAT TATTTGATA 

SEQ2810 TTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2811 TTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 801 AAGAGAATAT CAGT AAAGAAGATAGACAAAAGAT T - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 802 AAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAAT 

SEQ2 803 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 804 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2805 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT -AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 806 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2807 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT -AAAGAACTTTCTTTGTCACAGGGA 

SEQ2808 AAGAGAATAT CAGTAAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2809 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT -AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 810 AAGAGAATAT CAGT AAAGAAGATAGA CAAAAGAT T - AAAGAAC T TTCTTTGT CA CAGGGA 

SEQ2811 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT -AAAGAACTTTCTTTGTCACAGGGA 

SEQ2801 TACGTTA-AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 802 TCCGTGGCAGTTGTTGAATTTTTCTGATCCA TCATCTCAAAAAATTCACGATGATTA 

SEQ2 803 TACGTTA-AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 8 0 4 TACGTTA -AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 805 TACGTTA- AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 806 TACGTTA- AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2807 TACGTTA - AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2808 TACGTTA - AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 8 0 9 TACGTTA - AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2810 TACGTTA -AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2811 TACGTTA- AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 
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SEQ2801 
SEQ2 802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2 806 
SEQ2807 
SEQ2808 
SEQ2 80 9 
SEQ2 810 
SEQ2811 

SEQ2 801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2 805 
SEQ2 80 6 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2 811 

SEQ2801 
SEQ2 802 
SEQ2803 
SEQ2 804 
SEQ2805 
SEQ2806 
SEQ2 807 
SEQ2 80 8 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2 807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ280X 
SEQ2 802 
SEQ2803 
SEQ2 804 
SEQ2805 
SEQ2 80 6 
SEQ2807 
SEQ2808 



TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TTTAAACATTATGGTGTGAAGGAGTTAGAAATTGA - GAGCATTGCTTTAGGATTAGGTG 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 

AAAGAA CAAGGT CAG CGTT TAC TAGAAGAT TATGAAT CTTT TATATCAT CCGGTAGTT T 

TAATAG CAAAGAAAACACAC TG ATAAAGATGG CAGAT TATCGTTTGAAAAATT 

AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGT CAG C GT T TAC TAGAAGAT TATGAAT CTTT TATATCAT CCGGTAGTT T 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 

GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCCTT 

GGAGAGAC - - CCGATAC CAAAACCTTTTTAA AAGACTCCTATTATAGTATT 

GGAGCGACTAT CAATGCATGGCAAGACGAT TGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCy^AGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGGAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGTGTGGAATACATCCTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGTGTGGAATACATCCTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 

GCCACAAATAAACA.TAGTCAATTCCTATGGGGGGATGC^ 

A- -AGAAAGAA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCAC^GTATTTAATCAAGGTTA 

GCCAC!AAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACA 

GCCACAAATAAACATAATCAATTCCTATGGGGGGATGCAGAAGTATTTAATCAAGGTTA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGC^CAAGTATTTAATCAAGGTTA 

GCGACAAATAAACATAGTCAATTCCTATGGGGGGATGGACA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGGACAAGTATTTA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTA 

GGT T TAT TAGG C T T TAAAAAC G CAAAA CAT CAT TAT CAAGT TGATGGT AAAAGAGG CAA 

GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATC^GTTGATGGTAAAAGAGGCAA 
GGTTTATTAGGCTTTAAAAACGCAAAAC^TCATTATCAAGTTGATGGTAAAAGAGGCAA 
GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGGCAA 
GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGGCAA 
GGT TTATTAGGCTTTAAAAACGCAAAACAT CATTATCAAGTTGATGGTAAAAGAGG CAA 
GGTTTATTAGGCTTTAAAAACGCAA^ACATCATTATCAGGTTGATGGTAAAAGAGGCAA 
GGTTTATTAGX3CTTTAAAAACGCAAAACATCATTATCA.GGTTGATGGTAAAAGAGGCAA 
GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGGCAA 
GGTTTATTAGGCTTTAAAAACGCMAACATCATTATC^GTTGATGGTAAAAGAGGCAA 



GGAGAGTGGAAACATCCTCTG- 



GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCT CTGATGACTAGTGCAACAGGAGATGACT TATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GAAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
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SEQ2809 
SEQ2 810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2 80 9 
SEQ2810 
SEQ2 811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ280X 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ280 9 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2 803 
SEQ2 804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2 80 8 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2 804 
SEQ2 805 
SEQ2 806 
SEQ2807 
SEQ2808 
SEQ280 9 
SEQ2810 
SEQ2 811 

SEQ2 801 
SEQ2 802 
SEQ2803 
SEQ2804 



Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

GAAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 



GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAAOTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 



TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 



ACATTTTCTAAAT CTAGTGACTTTGTATTGTCTATTGATC CAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 



TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAACGCCTTAAAAGCGAACTATCTTCGACAGCTTAATGGTAA 
TTTGTCCAAGAGCGCTATAACGCCTTAAAAGCGAACTATCTTCGACAGCTTAATGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 



GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GAT T T T TATG C T T T C CCAC CAAAGAAG AACAGTAGTAAT T T TGAG CAGATAAAT ATGGT 
GAT T T T TATG C T T T C C CAC CAAAG AAG AACAGT AGTAAT T T T GAGCAGATAAAT AT GGT 
GATTTTTATGCTTT CCCACCAAAGAAGAACAGTAGTAATT TTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 



TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
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Table 28: Comparative Sequences relating to SAG1552 
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TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 



CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAG^ 
CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACCAAA 
CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACC^AAAACATT 
CCAACTCATCCTACTGGTCTTCTCAAAAC^ 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCaCCAAAA 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACCA^ 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAAC^ 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAAC^^ 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGAT^ 



GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTC^GAATTCCGTGGCA 
GATT(^CAACCAGATATTTCGTTTGGAAAGGACTTTATAGAGGTa^GAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCAGAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTGA.CAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 



TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATC1ATCTCAAAGAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTGAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 



GGTGTGAAGGAGT TAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGT TAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAAATTGAGAGCCATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG- - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 



AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
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(conserved hypothetical protein) 

SEQ2801 

SEQ2802 

SEQ2 803 AAACCTTTTTAAAAGACTCCTATTATGTATTAAGAAAGAA 

SEQ2804 AAACCTTTTTAAAAGACTCCTATTATAGTATTAAGAAAGAATGGTCTAAAGAAAGAGAG 

SEQ2 80S AAACCTTTTTAAAAGA 

SEQ2 806 AAAC CTTTTTAAAAGACTC CTATTATGTATTAAGAAAGA 

SEQ2807 AAACCTTTTTAAAAGACT — 

SEQ2808 AAACCTTTTTAAAAGACTCCTATTATAGT • 

SEQ2 809 AAACCTTTTTAAAAGACTCCTATTATAGTATTAAGAAAG 

SEQ2 810 AAACCTTTTTAAAAGACTCCTATTATAGTATTAAG - - 

SEQ2811 AAACCTTTTTAAAAGACTCCTATTATAGTATTAAGAAAGAATGG 



SEQ2801 

SEQ2802 

SEQ2803 

SEQ2804 GAACATATGGTCCA 

SEQ2805 . 

SEQ2806 

SEQ2807 

SEQ2808 

SEQ2809 

SEQ2810 

SEQ2811 

>SEQ ID NO 2850:62_1169NT frame: 1 

FWKGDTVLHKPTOKPFWKGVDVESSLAGYHHN^ 
KVPMNVAFYDALYHHNKASKKPLYLLQ 

ILHGRKQVWm , DFGSRHYHYDLSPWVLGYWGDDWNSGTVAyTNHQEKKTQYKGRYFKTS 
AAANPFEVMIiAQVMDELTHYETAKYGVJQHI.ISFSNSPTTDPFRYRKPFEAQAPKYVQLNV 
ENI QANSNVKAGI FAA YKAIDFHPRYKD YLLFDKENI S KEDRQKI KELSL S QG YVKLLNA 
YHKI PVLVTGYGYS TARGI AQKE I DKRPLP INEKEQGQRLLED YES F I S SGS FGAT INAW 
QDDWNARAVmTSFATNKHSQFLWGDAQVFNQGYGLLGFKNAKHHYQVDGKRGKGEWKHPL 
MTSATGDDLYASSDESYLYIiAIKTKPEKLKEKRLLPIDITPKSGSRKMNGSKVTFSKSSD 
FVLSIDPNGKSELFVQERYNAIiKANYLRQLNGKDFYAFPPKKNSSNFEQINIWLRNTKIV 
EDMEKVKATERFLPTHPTGLLKTGTIDRHQKTFDSQTDISFGKDFIEVRIPWQLLNFSDP 
S SQKI HDD YFKHYGVKELEI E S I ALGIjGANSKENTIiI KMAD YRLKNWERPDTKT FLKDS Y 
YSI.ER 

>SEQ ID NO 2851s 62_18RS21 frames 1 

KGLLKEl^RTNFWKGDTVLHKPmiCPFWKGVDVESSLAGYHHNDFPITQKTYREWFHL 
I SNMGANTVRVKVPMNVAF YDAL YHHNKAS KRPL YLLQGIRIDS YRNNAS I TAFNDNYRG 
YLK^EAKGVAmiLHGRKQVWNTDLGSRHYHYDLSP^WLGYWGDDWNSGTVAYTNHQEKK 
TQYKGRYFKTSVAANPFE^lLtAQWlDELTHYETAKYGWQHLISFSNSPTTDPFHYRKPFE 
AQAPKYVQLNVENI QANSNVKAGMFAA YKAI DFHPRYKD YLLFDKENI S KEDRQKI KELS 
LSQGYVKLLNAYHKIPVIiVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYESFIS 
SGS FGATINAWQDDVJNARAWNT S FATNKHS QFXjWGDAQVFNQGYGLLGFKNAKHHYQVDG 
KRGKGEWKHPLMTSATGDDLYASSDESYLYIiAIKTKPEKLKEKRLLPIDITPKSGSRKMN 
GSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYIiRQLNGKDFYAFPPKKNSSNFEQ 

inmvlrntkivedmek^katerflpthptgllktgttdrhqktfdsqtdisfgkdfievr 
ipwqllnfsdpssqkihddyfkhygvkeleiesialglganskentlikmadyrlknwer 
pdtktflkdsyyvlrk 

>SEQ ID NO 2852:62_2603 frames 3 

LKENTRTNFVVKGDTVIjHKPTNKPFWKGVDVESSIjAGYHHNDFP I TQKT YREWFHLISN 
MGANTVRVKVPMNVAFYDALYIIHNK&SKRPL^ 

RE^KGVVDILHGRKQVWNTDLGSRHYHYDLSPWVLGYWGDDWNSGTVAYTNHQEKKTQY 
KGR YFKTS VAANPFEVMLAQVMDELTH YETAKYGWQHIj I S FSNSPTTDPFHYRKPFEAQA 

pkyvqlnveniqansnvkagmfaaykaidfhprykdyllfdkeniskedrqkikelslsq 
gyvkllna yhki pvlvtgygys targi aqkei dkrplp inekeqgqrlled yes fis sgs 

FGATINAWQDDWNARAWTSFATNKHSQFLWGDAQVFNQGYGLLGFKNAKHHYQVDGKRG 
KGEWKHPLMTSATGDDLYASSDESYIiYLAIKTKPEKLKEKRLLPIDITPKSGSRKMKGSK 
VTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPKKNSSNFEQINM 
VLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI S FGKDFI EVRI PW 
QLLNFSDPSSQKIHDDYFKHYGVKELEIESIALGLGANSKENTLIKMADYRLKNWERPDT 
KTFLKDSYYSIKKEWSKERERTYGP 
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>SEQ ID NO 2853:€2_A909 frame: 1 

KGLLKENTRTNFWKGDTVLHKPTNKPFVVKGVDVESSIiAGYHHNDFPITQKTYREWFHL 
ISNMGAimmVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRimASITAFNDNYRG 
YLKREAKGWD I LHGRKQVWNTDLGSRHYH YDL S PWVLG YVVGDDWNSGTVAYTNHQEKK 
TQ YKGR YFKTS VAANP FEVMIoAQVMDELTHYETAKYGWQHL I S FSNS PTTDPFHYRKP FE 
AQAPKYVQLNVENI QANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENI SKEDRQKI KELS 
LSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYES FI S 
SGS FGAT INAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQG YGLLGF KNAKHH YQVDG 
KRGKGEWKHPLMTSATGDDLYASSDESYIiYLAIKTKPEKLKEKRLLPIDITPKSGSRKMN 
GSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPKKNSSNFEQ 
INMVLRNTKI VEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDS QTD I S FGKDF I EVR 
I PWQLLNFSDP S S QRI HDD YFKH YGVKELEN . EPLL . D . VLIAKKTH . . RWQIIV . KIGR 
DPIPKPF.K 

>SEQ ID NO 2854:62_A909 frame: 1 

KGLLKENTRTNFWKGDTVLHKPTNKPFVVKGVDVES SLAGYHHNDFP I TQKTYREWFHL 
ISNMGANTVRVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNASITAFNDNYRG 
YLKREAKGWD I LHGRKQVWNTDLGS RH YH YDLS PWVLGYWGDDWNS GTVAYTNHQEKK 
TQ YKGR YFKTS VAANP FEVMLAQVMDELTH YE TAKYGWQHLI S FSNSPTTDPFHYRKPFE 
AQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENI SKEDRQKI KELS 
LSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYESFIS 
SGSFGATINAWQDDWNARAWNTSFATNmSQFLWGDAQVFNQGYGLLGFKNAKHHYQVDG 
KRGKGEWKHPLMTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDITPKSGSRKMN 
GSKVTF SKS SDFVL S IDPNGKS EL FVQER YNALKANYLRQLNGKDF YAFPPKKNS SNFEQ 
INMVLRNTKIVEDMEICVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDISFGKDFIEVR 
I PWQLLNFSD P S S QRI HDD YFKHYGVKELEN . EPLL . D . VLIAKKTH . . RWQIIV . KIGR 
DPIPKPF . K 

>SEQ ID NO 2855 :62_CJB 110 frame: 1 

YYFDGS L YLP KGLLKENTRTNFVVKGDTVLHKPTNKPF WKGVDVE S S LAG YHHNDFP I T 
QKTYREWFHLISNMGANWRVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNAS 
ITAFNDNYRGYLKREAKGWD I LHGRKQVWNTDFGSRH YH YDL S PWVLGYWGDDWNS GT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHLISFSNSPTT 
DPFHYT^KP FEAQAPKYVQLNVENI QANSNVKAGMFAAYKA I DFHPR YKD YLL FDKENI S K 
EDRQKI KELSLSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 
LLED YE S F I S S GS FGAT INAWQDDWNARAWNTS FATNKHNQFLWGDAQVFNQGYGLLGFK 
NAKHHYQVDGKRGKGEWKHPLMTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDI 
TPKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKKNSSNFEQItJNVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
SFGKDFI EVRI PWQLLNFSDPS SQKIHDDYFKHYGVKELE I ES I ALGLGANSKENTLI KM 
ADYRLKNWERPDTKTFLKDSYYVLRK 

>SEQ ID NO 2856:62_COHl frame: 2 

LPQGLLKENTRTNFWKGDTVLHKPTNKPFWKGVDVESSLAGYHHNDFPITQKTYREWF 
HLI SNMGANTVRVKVPMNVAFYDALYHHNKES KRPLYLLQGIRIDS YRNNAS ITAFNDNY 
RGYLKREAKGWDI LHGRKQVWNTDFGSRH YH YDL S PWVLGYWGDDWNS GTVA YTNHQE 
KKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHLISFSNSPTTDPFHYRKP 
FEAQAP KYVQLNVENI QANSNVKAGMFAAYKAIDFHPR YKDYLLFDKENI S KEDRQKI KE 
LSLSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYESF 
IS S GS F GAT INAWQDDWNARAWNT S FATNKHS QFLWGDAQVFNQG YGLLGFKNAKHHYQV 
DGKRGKGEWKHPLMTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDITPKSGSRK 
MNGSKVTFSKSSDFVLS IDPNGKS ELFVQERYTSTALKANYLRQLNGKDFYAFPPKKNSSNF 
EQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQPDISFGKDFIE 
VRI PWQLLNFSDPS SQKIHDDYFKHYGVKELE I ES IALGLGANSKENTLI KMADYRLKNW 
ERPDTKTFLKD 

>SEQ ID NO 2857:62_H36B frame: 2 

RGLLKENTRTNFWKGDTVLHKPTNKPFWKGVDVESSLAGYHHNDFPITQKTYREWFHL 
ISNMGANTVRVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNASITAFNDNYRG 
YLICRFAKGVVDILHGRKQVWNTDFGSSHYHYDLSPWVLGYWGDDGHSGTVALY 

>SEQ ID NO 2858:62_JM9130013 frame: 3 

FWKGDTVLHKPTNKPFWKGVDVES SLAG YHHNDFP I TQKTYREWFHL I SNMGANTVRV 
KVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNASITAFNDNYRGYLKREAKGWD 
ILHGRKQVWNTDFGSSHYHYDLSPWVLGYWGDDWNSGTVAYTNHQEKKTQYKGRYFKTS 
VAANP FEVMLAQVMDELTHYETAKYGWQHL IS FSNS PTTDPFHYRKPFEAQAPKYVQLNV 
ENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENISKEDRQKIKELSLSQGYVKLLNA 
YHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYESFISSGSFGATINAW 
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QDDWNARVWNTSFATNKHSQFLWGDAQVFNQGYGLLGFKNAKHHYQVDGKRGKEEWKHPL 
MTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDITPKSGSRKMNGSKVTFSKSSD 
FVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPKKNSSNFEQINMVLRNTKIV 
EDMEKVKATERFLP THP TGIiLKTGTTDRHQKTFDSQTD I S FGKDF I EVRI PWQLLNFSD P 
S SQKIHDD YFKHYGVKELE I ES I ALGLGANS KENTLI KMAD YRLKNWERPDTKTFL KDS Y 
YSIKK 

>SEQ ID NO 2859:62_M732 frame: 2 

TRTNFVVKGDTVLHKP TNKP FVVKGVD VE S S LAG YHHNDF P I TQKTYREWFHLI SNMGAN 
TVRVKVPMNVAFYDALYHHNKESKRPLYLLQGIRIDSYRNNASITAFNDNYRGYLKREAK 
GWD I LHGRKQVWNTDFGS RHYHYDL S PWVLGYVVGDDCNSGTVAYTNHQEKKTQ YKGR Y 
FKTSVAANPFEVlvn^QVMDEIiTHYETAKYGWQHLISFSNSPTTDPFHYRKPFFAQAPKYV 
QLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYIiLFDKENISKEDRQKIKELSLSQGYVK 
LLNAYHKI PVIiVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLIiEDYESF I SSGS FGAT 
INAWQDDWNARAWNTS FATNKHS Q FLWGDAQVFNQGYGLLGFKNAKHH YQVDGKRGKGEW 
KHPLMTSATGDDLYASSDESYLYIiAIKTKPEKLKEKRLLPIDITPKSGSRKMNGSKVTFS 
KS SDFVLS IDPNGKS EL FVQER YNALKANYLRQLNGKDF YAFP PKKNS SNFEQ I NMVLRN 
TKI VEDMEKVKATERFLPTHPTGIiXiKTGTTDRHQKTFDSQTDI SFGKDFI EVRI PWQLLN 
FSDPSSQKIHDDYFKHYGVKELEI ESI ALGLGANS KENTLI KMAD YRLKNWERPDTKTFL 
KDSYYSIK 

>SEQ ID NO 2860:62_M781 frame: 1 

FDGSLYLPQGLLKENTRTNFWKGDTVLHKPTNKPFVVKGVDVESSLAGYHHNDFPITQK 
TYREWFHLISNMGANTVRVKVPMNVAFYDALYHHNKESKRPLYLLQGIRIDSYRNNAS 
AFNDNYRG YLKREAKGVVD I LHGRKQVWNTDFGS RHYH YDLS P WVLGYWGDDWNS GTVA 
YTNHQEKKTQYKGRYFKTS VAANP FEVMLAQVMDELTHYETAKYGWQHL I S FSNS PTTD P 
FHYRKP FEAQAPKYVQLNVENI QANSNVKAGMFAAYKAIDFHPRYKD YLL FDKENI SKED 
RQKI KELSLSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLL 
EDYE S F I S S GS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFKNA 
KHHYQVDGKRGKGEWKHPLMTSATGDDL YAS S DES YL YLAI KTKPEKLKEKRLL P I DI TP 
KSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPK 
KNS SNFEQ INMVLRNTKI VEDMEKVKATERFLPTHPTGLLKTGTTDRHQKT FDS QTD I S F 
GKD F I EVRI PWQLLNF SD PS SQKI HDD YFKHYGVKELE I E S I ALGLGANS KENTLI KMAD 
YRL KNWERPDTKTFLKDS Y YS I KKEW 



SEQ2850 FVVKGDTVLHKPTNKP FVVKGVD VE S SLAGYHHNDFP I T 

SEQ2851 KGLL KF^TRTNFWKGDTVLHKP TNKP FVVKGVD VES SLAG YHHNDF PIT 

SEQ2 852 LKENTRTNFVVKGDTVLHKPTNKPFVVKGVDVES SIoAGYHHNDFP I T 

SEQ2 853 KGLLKENTRTNFVVKGDTVLHKPTNKPFVVKGVDVESSLAGYHHNDFPIT 

SEQ2 854 KGLLKENTRTN FWKGD TVLH KP TNKP F WKGVD VE S S LAG YHHND F P I T 

SEQ2855 YFDGSLYLPKGLLKENTRTNFWKGDTVLHKPTNKPFWKGVDVESSLAGYHHNDFPIT 

SEQ2 856 LPQGLLKENTRTNFVVKGDTVLHKPTNKPFVVKGVDVESSLAGYHHNDFPIT 

SEQ2 857 RGLLKENTRTNFVVKGDTVLHKPTNKP FWKGVDVES SLAGYHHNDF P I T 

SEQ2 858 FVVKGDTVLHKPTNKPFVVKGVDVES SLAG YHHNDF PIT 

SEQ2 859 TRTNFWKGDTVLHKPTNKP FWKGVDVES S LAG YHHNDFP I T 

SEQ2 860 - FDGSL YLPQGLLKENTRTNFVVKGDTVLHKP TNKP FWKGVDVES S LAG YHHNDFP I T 

SEQ2850 QKTYREWFHLISNMGANTVRVKVPMlWAFYDALYHHNICASKRPLY r LLQGIRIDSYRNNAS 

SEQ2851 QKTYREWFHLISNMGANTVRVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNAS 

SEQ2852 QKTYREWFHLISNMGANTWVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNAS 

SEQ2 853 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKASKRPLYLLQGIRIDS YRNNAS 

SEQ2854 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKAS KRPLYLLQGIRIDS YRNNAS 

SEQ2855 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKASKRPLYLLQGIRIDS YRNNAS 

SEQ2 856 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKES KRPLYLLQGIRIDS YRNNAS 

SEQ2857 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKASKRPLYLLQGIRIDS YRNNAS 

SEQ2 858 QKTYREWFHLISNMGANTTOVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDS YRNNAS 

SEQ2 859 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKESKRPLYLLQGIRIDS YRNNAS 

SEQ2860 QKTYREWFHLISNMGANTVRVKVPMNVAFYDALYHHNKESKRPLYLLQGIRIDS YRNNAS 

SEQ2 850 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDFGS RHYH YDLS PWVLG YWGDDWNSGT 

SEQ2 851 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDLGS RHYH YDLS PWVLGYWGDDWNSGT 

SEQ2852 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDLGS RHYH YDLS PWVLGYWGDDWNSGT 

SEQ2853 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDLGS RHYHYDL S PWVLGYWGDDWNSGT 

SEQ2 854 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDLGS RHYH YDLS PWVLGYWGDDWNSGT 

SEQ2855 I TAFNDNYRGYLKREAKGWDI LHGRKQVWNTDFGS RHYHYDL S PWVLGYWGDDWNSGT 

SEQ2 856 I TAFNDNYRG YLKREAKGWDI LHGRKQVWNTDFGSRH YH YDL S PWVLGYWGDDWNSGT 

SEQ2 857 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDFGS SH YH YDL SPWVLGYWGDDGHSGT 

SEQ2858 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDFGS SHYH YDLS PWVLGYWGDDWNSGT 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 



SEQ2859 
SEQ2 86 0 
SEQ2 850 
SEQ2851 
SEQ2852 
SEQ2 853 
SEQ2854 
SEQ2855 
SEQ2 856 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2 850 
SEQ2 851 
SEQ2852 
SEQ2853 
SEQ2854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2 850 
SEQ2851 
SEQ2 852 
SEQ2 853 
SEQ2854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2850 
SEQ2851 
SEQ2852 
SEQ2 853 
SEQ2 854 
SEQ2 855 
SEQ2 856 
SEQ2857 
SEQ285 8 
SEQ2 85 9 
SEQ2860 

SEQ2850 
SEQ2851 
SEQ2852 
SEQ2853 
SEQ2 854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2850 
SEQ2851 
SEQ2852 
SEQ2853 
SEQ2854 
SEQ2855 



ITAFNDNYRGYLKREAKGVVDILHGRKQVWNTDFGSRHYHYDLSPWVLGYVVGDDCNSGT 
ITAFNDimiGYLKREAKGVVDILHGRKQVWNTDFGSRMYHyDLSPVmjGyWGDDWNSGT 
VAYTNHQEKKTQ YKGR YFKTSAAANPF EJVMLAQVMDELTH YETAKYGWQHL I S F SNS PTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHL I SFSNSPTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHLISFSNSPTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFE^LAQVMDELTHYETAKYGWQHLISFSNSPTT 
VAYTiraQEKKTQYKGRYFKTSVAANPFEVMIiAQVMDELTHYETAKYGWQHIilSFSNSPTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMIiAQVMDELTHYETAKYGWQHL I SFSNS PTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFF^MLAQVMDELTHYETAKYGWQHLISFSNSPTT 

VALY 

VAYTNHQEKKTQYKGRYFKTSVAANPFEVMIiAQVMDELTHYETAKYGWQHL I SFSNSPTT 
VAYTNHQE KKTQ YKGR YFKTS VAANP F EVMIiAQVMDELTH YETAKYGWQHL I S F SNS PTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHL I SFSNSPTT 

PFRYRKPFEAQAPKYVQLNVENIQANSNVKAGIFAAYKAIDFHPRYKDYIiLFDKENISK 
PFHYRKPFEAQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENISK 
PFHYRKPFEAQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYIjLFDKENISK 
P FH YRKP FEAQ APKWQIiNVENI QANSNVKAGMFAAYKAI DFHPRYKD YLL FDKENI SK 
PFHYRKPFEAQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENISK 
P FH YRKP FEAQAPKYVQLNVEN I QANSNVKAGMFAAYKAI DFHPR YKD YLL FDKENI SK 
P FH YRKPF FAQAPKYVQLNVENI QANSNVKAGMFAAYKAI DFHPR YKD YLL FDKENI S K 

PFHYRKPFEAQAPKYVQLNVENI QANSNVKAGMFAAYKAIDFHPRYKD YLL FDKENI SK 
P FHYRKPFEAQAPKWQIiNVENI QANSNVKAGMFAAYKAIDFHPRYKD YLL FDKENI SK 
P FH YRKPF EAQAPKYVQLNVENI QANS NVKAGMFAAYKAI DFHPRYKD YLL FDKENI SK 

DRQKI KELS L S QGYVKLLNAYHKI PVLVTG YGYS TARGI AQKE I DKRPLP I NEKEQGQR 
DRQKI KELS LS QGYVKLLNAYHKI PVLVTG YGYS TARGI AQKE IDKRPLPINEKEQGQR 
DRQKI KELS LSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPI NEKEQGQR 
DRQKIICELSLSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 
DRQKI KELSLSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 
DRQKI KELSLSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 
DRQKI KELSLSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 

DRQKI KELSLS QGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 
DRQKI KELSLS QGYVKLLNAYHKI PVLVTGYG YS TARG I AQKE I DKRPLP I NEKEQGQR 
DRQKIKELSLSQGYVKLLNAYHKI PVLVTG YGYS TARGI AQKE IDKRPLPINEKEQGQR 

LEDYESFISSGSFGATINAWQDDWNARAWNTSFATNKHSQFLWGDAQVFNQGYGLLGFK 
LED YES FI S SGS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFK 
LED YES F I S S GS FGATI NAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQ GYGLLGFK 
LED YES F I S S GS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLL GFK 
LEDYESFISSGSFGATINAWQDDWNARAWNTSFATNKHSQFLWGDAQVFNQGYGLLGFK 
LED YE S F I S S GS FGATINAWQDDWNARAWNTS FATNKHNQFLWGDAQVFNQGYGLLGFK 
LED YES F I SSGS FGAT INAWQDDWNARAWNTS FATNKHSQFLWGDAQVFNQ GYGLLGFK 

LED YESFI SSGSFGATINAWQDDWNARVWNTS FATNKHSQFLWGDAQVFNQGYGLLGFK 
LED YES F I S S GS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFK 
LEDYESFISS GS FGAT INAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFK 

AKHHYQVDGKRGKGEWKHPLMTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDLYASSDESYL'YLAIKTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDL YAS SDESYLYLAI KTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDLYAS SDES YLYLAIKTKPEKLKEKRLLP IDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDL YASSDESYLYLAIKTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDLYASSDES YLYLAI KTKPEKLKEKRLLP IDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDL YAS SDES YLYLAIKTKPEKLKEKRLLP ID I 

AKHHYQVDGKRGKEEWKHPLMTSATGDDLYAS SDESYLYLAI KTKPEKLKEKRLLP IDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDLYAS SDES YLYLAI KTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDL YASSDESYLYLAIKTKPEKLKEKRLLPIDI 

PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKS GS RKMNGS KVTFS KS SDFVLS I DPNGKS EL FVQER YNALKAN YLRQLNGKD F YAFP 
PKS GSRKMNGS KVT FS KS S DFVLS I DPNGKS ELFVQER YNALKANYLRQLNGKDF YAFP 



642 



WO 2004/018646 



PCT/US2003/026827 



Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 



SEQ2 856 
SEQ2857 
SEQ285 8 
SEQ2 85 9 
SEQ28 60 

SEQ2 850 
SEQ2 851 
SEQ2 852 
SEQ2853 
SEQ2 854 
SEQ2 855 
SEQ285 6 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2850 
SEQ2 851 
SEQ2852 
SEQ2 853 
SEQ2 854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ2 85 8 
SEQ2859 
SEQ2 8 60 



PKSGSRKMNGSKVTFSKSSDWLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 

PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVI1SIDPNGKSELFVQERYNALKANYLRQI1NGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 

KKNSSNFEQINMVLRNTKIVEDMEK7KATERFLPTHPTGLLKTGTIDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEK\nCATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQPDI 

KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 

FGKDFIEVRIPWQLLNFSDPSSQKIHDDYFKHYGVKELEIESIALGLGANSKENTLIKM 
FGKDFIEVRIPWQLLNFSDPSSQKIHDDYFKHYGVKELEIESIALGLGANSKENTLIKM 
FGKDF I EVRI PWQLLNFSDPS S QKIHDD YFKHYGVKELE I ES I ALGLGANS KENTL I KM 
FGKDF I EVRI PWQLLNFSDPSSQRIHDDYFKHYGVKELENEPLLDVL I AKKTHRWQ I IV 
FGKDFI EVRI PWQLLNFSDPS SQRIHDDYFKHYGVKELENEPLLDVL I AKKTHRWQ I IV 
FGKDF I EVRI PWQLLNFSDPS S QKIHDD YFKHYGVKELE I ES I ALGLGANSKENTL I KM 
FGKDFIEVRIPWQLLNFSDPSSQKIHDDYFKHYGVKELEIESIALGLGANSKENTLIKM 

FGKDF I EVRI PWQLLNFSDP S S QKI HDD YFKHYGVKELE I ES I ALGLGANS KENTL I KM 
FGKDF I EVRI P WQLLNFSDP S S QKI HDD YFKHYGVKELE I ES I ALGLGANS KENTL I KM 
FGKDF I EVRI PWQLLNFSDP S S QKIHDD YFKHYGVKELE I ES I ALGLGANS KENTL I KM 



SEQ2850 
SEQ2851 
SEQ2852 
SEQ2853 
SEQ2854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ285 8 
SEQ2859 
SEQ286 0 



DYRLKNWERPDTKTFLKDS YYS IER 

D YRLKNWERPDTKT FL KDS YYVLRK 

DYRLKNWERPDTKTFLKDS YYS IKKEWSKERERTYGP 

IGRDPIPKPFK 

IGRDPI PKPFK : 

DYRLKNWERPDTKTFLKDS YYVLRK- 

DYRLKNWERPDTKTFLKD 

DYRLKNWERPDTKTFLKDS YYS I KK- 

DYRLKNWERPDTKTFLKDS YYS IK 

DYRLKNWERPDTKTFLKDS YYS IKKEW 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 

SEQ ID NO. 2901: SAG1641 FROM THE 090 GBS TYPE la STRAIN 

AATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAA 
GCACGTTGGGATAAAATTGAAAAGCTAGTAGGCGATAAAGCTAAAATCAAATTCACAGAATTTACAGATTATACACAA 
CCAAATCAAGCGAC AGC C AATAAGGATGTGGATATTAATGC CTTTC AACATTACAATTTC TTAGAAAACTGGAATAAG 
GAAAATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCCCCAATTCGTATCTATTCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTT 
CAGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAA 
GATATTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAAT 
ACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
CAC ACAGATGAAGTGAAAAAAGTTATC AAAGATACTTC AGCTGATATTC C AC AATGGAAC C CAGCTTTC TTGTAC AA 

SEQ ID NO. 2902: SAG1641 FROM THE 1169NT1 GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAG 
CACGTTGGGATAAAATTGAAAAGC TAGTAGGTGATAAAGCTAAAAT CAAATTTACAGAAT TTACAGATTATACACAAC 
CAAATCAAGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGG 
AAAATAAGAAAAAC TTAATTC C AC TTGAAAAGACTTACTTAGCTC CAATTCGTATCTATT CTGAGAAGGTAAAATCTC 
TTAAAAAATTGAAAAAAGGAGC CACTATTGC AATTC CAAATGATGCAAC AAATGGTAGCCGTGCATTGTATGT C CTTC 
AGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGG 
ATATTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATA 
CATAC ATTGAGCAAGCTAATTTAAAAC CTTCAGATGCTATC TTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGA 
TTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATC 
AC ACAGATGAAGTGAAAAAAGTTATC AAAGAT ACTTCAGCTGATATT C CACAATGG 

SEQ ID NO. 2903: SAG1641 FROM THE 18RS21 GBS TYPE II STRAIN 

AATCAAGAAGT TT CAGCAAGCTC AACTTC AAGTAAAGTTGTTAAAGTTGGTGT TATGAC CTTTTCTGACAC TG AAAAA 
GCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAA 
C CAAATCAAGCGAC AGC CAATAAGGATGTGGATATTAATGCCTTT CAACATTACAATTTC TTAGAAAACTGGAATAAG 
GAAAATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTT 
CAGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAG 
GATATTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAAT 
ACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
CACACAGATGAAGTGAAAAAAGTTATCAAAGAT ACTTCAGCTGATATT C CAC 

SEQ ID NO. 2904: SAG1641 FROM THE 2603 V/R GBS TYPE V STRAIN 

AATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAA 
GCACGTTGGGATAAAATTGAAAAG CTAGTAGGTGATAAAGCTAAAATCAAATT TACAGAATTTACAGATTATACACAA 
CCAAATCAAGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAG 
GAAAATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGC CAC TATTGCAATT CCAAATGATGC AACAAATGGTAGC CGTGCATTGTATGTC CTT 
C AGT C AGC AG GT T T AAT C AAAT TG AATGT T T C TGGT AAGAAGGT TG C AACAGT TG C T AAT AT CAC AT C T AAT AAAAAG 
GATATTAATATTCAGGAGTTAGATGCCAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAAT 
AC AT AC ATTGAGCAAGCTAATTTAAAAC CTT CAGATGC TATCTT TGTTGAGAAAT C AGATAAAAATT C AAAAC AATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
CACAC AGATGAAGTGAAAAAAGTT AT CAAAGATACT TC AGC TGATATT C CACAATGG 

SEQ ID NO. 2905: SAG1641 FROM THE A909 GBS TYPE la STRAIN 

AATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAA 
GCACGTTGGGAT AAAATTGAAAAGC TAGTAGGTGAT AAAGC TAAAAT C AAATTTACAGAATTTACAGATT ATAC AC AA 
C CAAATCAAGCGAC AGC CAATAAGGATGTGGAT ATT AATGC CTTT CAACATTACAAT TTC TTAGAAAAC TGG AAT AAG 
GAAAATAAGAAAAAC TTAATTCC ACTTGAAAAGACT TACTTAGCTC C AATT CGTATCTAT TCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTT 
CAGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAG 
GATATTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAAT 
AC ATAC ATTGAG C AAGC TAATTTAAAACC TTCAGATGC TATCT TTGTTGAGAAATCAGAT AAAAATTC AAAACAATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
C ACACAGATGAAGTGAAAAAAGTTAT C AAAGAT ACTT C AGC TGATATTC CACAATGG 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 

SEQ ID NO. 2906: SAG1641 FROM THE CJBUO GBS NONTYPEABLE STRAIN 

AAGTAAAGTTGTTAAAGTTGGTGT TATGACC TT TTCTGAC ACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGT 

AGGCGATAAAGC TAAAAT CAAATT C ACAGAATTTAC AGATTATACAC AACCAAATC AAGCGACAGCCAATAAGGATGT 

GGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAATAAGAAAAACTTAATTCCACTTGA 

AAAGACTTACTTAGCCCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTAAAAAATTGAAA2\AAGGAGCC^ 

TGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGT 

TTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAAGATATTAATATTCAGGAGTTAGATGCGAG 

TCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACATACATTGAGCAAGCTAATTTAAAACC 

TTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGATTAATATCATTGCGGGACGTAAAAATTG 

GAAAAAGCAAAAGAACGC TAAAGCTATC CAAGCTATCT TGGATGC TTAT CACACAGATGAAGTGAAAAAAGTT ATC AA 

AGATACTTC AGCTGAT ATT C C AC AATGGAA 

SEQ ID NO- 2907: SAG1641 FROM THE COH1 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

AGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAGCACGTTG 
GGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCA 
AGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAATAA 
GAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTAAAAA 
ATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTACTTCAGTCAGC 
AGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAA 
TATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACATACAT 
TGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGATTAATAT 
CATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGA 
TGAAGTGAAAAAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 

SEQ ID NO. 2908: SAG1641 FROM THE H36b GBS TYPE lb STRAIN 

AAGAAGTTTCAG C AAGC T CAACTT CAAGTAAAGTTGTTAAAGTTGGTGTTATGAC CT TTT CTGACACTGAAAAAGGAC 
GTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAACCAA 
ATCAAGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAA 
ATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTA 
AAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTTCAGT 
CAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATA 
TTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACAT 
ACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGATTA 
ATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGC TATC C AAGCTAT CTTGGATGCTTATCACA 
CAGATGAAGTGAAAAAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 

SEQ ID NO. 2909: SAG1641 FROM THE JM3190013 GBS TYPE VIII STRAIN 

TTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAGCACGTTGGGA 
TAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCAAGC 
GACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAATAAGAA 
AAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTAAAAAATT 
GAAAAAAGGAGC CACTATTGC AATTC CAAATGATGC AACAAATGGTAGCCGTGCATTGTATGTC CTT CAGTC AGCAGG 
TTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATAT 
TCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACATACATTGA 
GC AAGCTAATTTAAAAC C TTCAGATGC TAT CTT TGTTGAGAAATC AGATAAAAATTCAAAACAATGGATTAATATCAT 
TGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGA 
AGTGAAAAAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 

SEQ ID NO. 2 910: SAG1641 FROM THE M732 GBS TYPE III STRAIN 

AATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAA 
GCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAA 
CCAAATCAAGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAG 
GAAAATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTT 
CAGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAG 
GATATTAATAT T CAGGAGTTAGATGCGAGTC AAACA C C ACGTG C ACTC AAAGATGTAGATGCAGCTATTATTAATAAT 
ACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
CACACAGATGAAGTGAAAAAAGTTATCAAAGATAC 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 

SEQ ID NO. 2911: SAG1641 FROM THE M781 GBS TYPE III STRAIN 

AGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAGCACGTTG 
GGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCA 
AGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAATAA 
GAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTAAAAA 
ATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGC 
AGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAA 
TATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACATACAT 
TGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGATTAATAT 
CATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTGGGATGCTTATCACACAGA 
TGAAGTGAAAAAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 



SEQ2 901 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 902 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 903 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 904 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 905 ATGPAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 906 AAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 907 AGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 908 AAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2909 TTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 910 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 911 - AGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 901 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGCGATAAAGCT 

SEQ2 902 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 903 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 904 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 905 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGA2UUVGCTAGTAGGTGATAAAGCT 

SEQ2 906 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGCGATAAAGCT 

SEQ2 907 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 908 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 909 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 9 10 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2911 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 901 AAAATCAAATTCACAGAATTTACAGATTATACACAACCA^ 

SEQ2 902 AAAATCAAATTTACAGAATTTACAGATTATACAGAACCAA^ 

SEQ2 903 AAAATCAAATTTACAGAATTTACAC^TTATACACAACCAAATCAAGCGACAGCCAATAAG 

SEQ2904 AAAAT CAAAT T T ACAG AAT TTACAGAT T AT ACACAACCAAAT CAAGCGACAGC CAATAAG 

SEQ2 905 AAAA.TCAAATTTACAGAATTTACAGATTATACACAACCAAATCAAGCGACAG 

SEQ2 906 AAAATCAAATTCACAGAATTTACAGATTATACACAACaVAATC^GCGACAGCCAATAAG 

SEQ2 907 AAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATGAAGCGACAGCCAATAAG 

SEQ2 908 AAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCAAGCGACAGCCAATA^G 

SEQ2 909 AAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCAAGCG^ 

SEQ2910 AAAAT CAAAT TTACAGAAT TTACAGAT TAT ACACAACCAAAT CAAGCG ACAGCCAAT AAG 

SEQ2 911 AAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCAAGCGACAGCCAATAAG 

SEQ2 901 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 902 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 903 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 904 GATGTGGATATTAATGCCTTTC!AACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 905 GATGTGGATATTAATGCCTTTCAACATTACZAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 906 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ 2907 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 908 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 909 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 910 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 911 GATGTGGATATTAATGCCTTTCMO^TTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 901 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCCCCAATTCGTATCTATTCTGAG 

SEQ2902 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

SEQ2 903 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG • 

SEQ2 904 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

SEQ2 905 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

SEQ2 906 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCCCCAATTCGTATCTATTCTGAG 

SEQ2 907 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

SEQ2 908 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 



SEQ2909 
SEQ2910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ2906 
SEQ2907 
SEQ2903 
SEQ2909 
SEQ2910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ2 906 
SEQ2907 
SEQ290 8 
SEQ2 909 
SEQ2910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ2906 
SEQ2907 
SEQ2908 
SEQ2909 
SEQ2910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ290 6 
SEQ2907 
SEQ2908 
SEQ2909 
SEQ2 910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2 905 
SEQ2906 
SEQ2907 
SEQ2908 
SEQ2909 
SEQ2910 
SEQ2911 



AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 
AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 
AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

AAGGT AAAAT CTCT T AAAAAAT TG AAAAAAGGAGC CACT AT TGCAATT CCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCMTTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCC^CTATTGCMTTC(^^TGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAAT TGAAAAAAGGAGCCACTATTGCAATT CCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTG(^TTCCAAATGATGCA 

ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTACTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCIAGTCAGCAGGTTTAATCAAATTGAATGTT 
AC^AATGGTAGCCGTGC^TTGTATGTCCTTCAGTCAGCAGGTTTAATaUlATTGAATGTT 

TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAAGATATTAATATT 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATATT 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATATT 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATGACATCTAATAAAAAGGATATTAATATT 
T CT GGT AAGAAGGT TGCAACAGT TGC T AAT AT CACAT CTAAT AAAAAGGAT AT T AAT AT T 
TC TGGT AAGAAGGT TG CAACAGT T GC TAATAT CA CAT C TAATAAAAAAGATAT T AATAT T 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATATT 
TCTGGTAAGAAGGT TG CAA CAGT TGC T AATATCACAT CTAATAAAAAGGATAT TAATAT T 
TC T GGTAAGAAGGT TG CAA CAGT T GC TAATAT CACAT C TAATAAAAAGGATATTAATAT T 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATATT 
T CT GGTAAGAAGGT TG CAA CAGT T GC TAATAT CACATCTAATAAAAAGGATAT TAATAT T 

CAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGAT 

CAGGAGTTAGATGCGAGTC^AACTICCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

(^GGAGTTAGATGCGAGTC^AACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

CAGGAGTTAGATGCCAGTCAAACAC CACGTGCACT CAAAGATGTAGATGCAGCTATTAT T 

C^GGAGTTAGATGCGAGTCAAACACCACGTGC^CTCAAAGATGTAGATGCAGCTATTATT 

C^GGAGTTAGATGCGAGTCAAACACC^CGTGCACTCAAAGATGTAGATGCAGCTATTATT 

CAGGAGT TAGATGCGAGT CAAACAC CACGTGCACT CAAAGATGTAGATGCAGCT AT TAT T 

(^GGAGTTAGATGCGAGTCAAACACCACGTGCACTGAAAGATGTAGATGCAGCTATT^ 

CAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGC^ 

CAGGAGTTAGATGCGAGTCAAACACCACGTGC^CTCAAAGATGTAGATGCAGCTATTATT 

C^GGAGTTAGATGCGAGTCAAAC^CCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AAT AAT ACAT ACAT TGAGCAAGCT AAT T T AAAAC CT T CAG ATG CT AT CT T TGT T GAGAAA 
AATAATACATACATTGAGGAAGCTAATTTAAAACCTTC^AGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACAT TGAGCAAG CTAAT TTAAAACCTTCAGATG CTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATAC^TTGAGC^GCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATAC^TTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 



SEQ2901 
SEQ2902 
SEQ2 90 3 
SEQ2904 
SEQ2 905 
SEQ2 90 6 
SEQ2907 
SEQ2908 
SEQ2 90 9 
SEQ2 910 
SEQ2911 

SEQ2901 
SEQ2 902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ2 90 6 
SEQ2907 
SEQ2 90 8 
SEQ2909 
SEQ2910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ2906 
SEQ2907 
SEQ2908 
SEQ2909 
SEQ2910 
SEQ2911 



TCAGATAAAAATTCAAAACAATGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 
TCAGATAAAAATT CAAAACAATGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 
T CAGATAAAAAT T CAAAACAAT GG ATTAAT AT CAT T G C GG GACGTAAAAAT TGGAAAAAG 
TCAGATAAAAATTCAAAAC^TGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 
TCAGATAAAAATT CAAAACAAT GGATTAATAT CAT TGCGGGACGTAAAAAT TGGAAAAAG 
TCAGATAAAAATTGAAAACAATGGATTAATATCATTGCGGG^ 

TCAGATAAAAATTCAAAACAATGGATTAATATC^TTGCGGGACGTAAAAATTGGAAAAAG 
T CAGAT AAAAAT T CAAAACAATGGAT TAAT AT CAT T G CGGGAC GTAAAAAT TGGAAAAAG 
TCAGATAAAAATTCAAAAC^TGGATTAATATC^TTGCGGGACGTAAAAATTGGAAAAAG 
T CAGAT AAAAATTCAAAACAAT GGATTAATAT C^TTGCGGGACGTAAAAATTGGAAAAAG 
TC^GATAAAAATTCAAAACAATGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 

Cy^AAAGAACGCTAAAGCTATCC^GCTATCTTGGATGCTTATC^CAC^GATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCA 

CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATGACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCAC^CAGATGAAGTGAAA 
C^AAAGAACGCTAAAGCTATCC^^GCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCAC^ 

CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCAC1ACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCA.CACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTGGGATGCTTATCACA(CAGATGAAGTGAAA 

AAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGGAACCCAGCTTTCTTGTACAA 

AAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 

AAAGT TAT CAAAGATAC TT CAG C T GATATT C CAC 

AAAGT T AT CAAAGAT A C T T CAG C T GATAT T CCACAATGG 

AAAGTTATGAAAGATACTTCAGCTGATATTCCACAATGG 

AAAGTT AT CAAAGAT A C T T CAG C TGAT ATT C CACAATGGAA 

AAAGTTATCAAAGATACTTCAGCTGAT ATT CCACAATGG 

AAAGT TAT GAAAGATACTT CAG CT GATATT CCACAATGG 

AAAGTTATCAAAGATACTTCAGCTGATATT CCACAATGG 

AAAGTT AT CAAAGATAC 

AAAGTTATCAAAGATACTTCAGCTGATATT CCACAATGG 



>SEQ ID NO 2950: 35_090 frame: 1 

NQEVSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 
DVDINAFQHYNFLENWNKENKKNLI PLEKTYI^PIRI YSEKVKSLKKLKKGATIAI PNDA 
TNGSRALYVLQSAGLI KLNVSGKKVATVANI TSNKKDINIQELDASQTPRALKDVDAAII 
NNTYIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 
KVI KDTSADI PQWNPAFLY 

>SEQ ID NO 2951: 35_1169NT frame: 3 

QEVSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANKD 
VDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLK^ 
NGSRALYVLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRALKDTO 
NTYIEQANIiKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVKK 

VIKDTSADIPQW 

>SEQ ID NO 2952: 35_18RS21 frame: 1 

NQEVSASSTS S KVVKVGVMTFSDTEKARWDKI EKLVGDKAKIKFTEFTDYTQPNQATANK 
DVDINAFQHYNFLENWNK^NKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 
TNGSRAL YVLQSAGLI KLNVSGKKVATVANITSNKKDINI QELDAS QTPRALKDVDAAI I 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
KVI KDTSADI P 



>SEQ ID NO 2953:35_2 603 frame: 1 

NQEVSASSTSSKWKVGVMTFSDTEKARVTOKI EKLVGDKAKIKFTEFTDYTQPNQATANK 
DVD INAFQHYNFLENWNKENKKNL I PLEKT YLAP I RI YS EKVKSLKKLKKGATI AI PNDA 
TNGSR^LWLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAII 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
KVIKDTSADIPQW 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 

>SEQ ID NO 2954:35_A909 frame: 1 

NQEVSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 

DVDINAFQHYNFLENWNKENKKNLI^ 
TNGSRALYVLQSAGLIKLNVSGKKVATVANITSNKKDI^^ 

NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
KVIKDTSADIPQW 

>SEQ ID NO 2955:35_CJB110 frame: 2 

SKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANKDVDINAFQHY 
NFLENWNKENKICNLI PLEKTYLAP I RIYSEKVKSLKKLKK^ 

QSAGLI KLNVSGKKVATVANI TSNKKDINI QELDASQTPRALKDVDAAI INNTYIEQANL 
KPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAI QAILDAYHTDEVKKVI KDTSAD I 
PQW 

>SEQ ID NO 2 956:35_COHl frame: 2 

VSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANKDVD 
INAFQHYNFLENWKENKKNLIPLEKTYIiAPIRIYSEKVKSLKKLKKGATIAIPNDATNG 
SFJ^LYVLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRAIiKDVDAAIINNT 
YIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWK^QKNAKAIQAILDAYHTDEVKKVI 
KDTSAD I PQW 

>SEQ ID NO 2957:35_H36B frame: 3 

EVS AS S TS SKVVKVGVMTFSDTEKARWDKT EKLVGDKAKI KFTEFTD YTQPNQATANKDV 
DINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDATN 
GSRALYVLQSAGLIKLNVSGKKVATVANITSNKKD INI QELDASQTPRALKDVDAAI INN 
TYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAICAIQAILDAYHTDEVKKV 

IKDTSADIPQW 

>SEQ ID NO 2958 : 35_JM9130013 frame: 2 

SASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANKDVDI 
NAFQHYNFLENWNKENKKNLI PLEKTYLAP I RI YSEKVKSLKKLKKGATIAI PNDATNGS 
RALYVLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAIINNTY 
I EQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAI QAILDAYHTDEVKKVI K 
DTSADIPQW 

>SEQ ID NO 2959:35_M732 frame: 1 

NQEVSASSTSSKWKVGVMTFSDTEKARWDKI EKLVGDKAKI KFTEFTD YTQPNQATANK 
DVD INAFQHYNFLENWNKENKKNL I PLEKTYLAP I RIYSEKVKSLKKLKKGATIAIPNDA 
TNGSRALYVLQSAGL I KLNVSGKKVATVAN I TSNKKD INI QELDASQTPRALKDVDAAI I 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
KVIKD 

>SEQ ID NO 2960:35_M781 frame: 2 

VS AS STS S KWKVGVMTFSDTE KARWDKI EKLVGDKAKI KFTE FTDYTQPNQATANKDVD 
INAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDATNG 
SRALYVLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAIINNT 
YIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAIWDAYHTDEVKKVI 

KDTSAD I PQW 

SEQ2 950 QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTD YTQPNQATANK 

SEQ2951 QEVSASSTSSKVVTCVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTD YTQPNQATANK 

SEQ2952 QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTD YTQPNQATANK 

SEQ2953 QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2954 QEVSASSTSSKWKVGVMTFSDTEKARWDKIEKIiVGDKAKIKFTEFTD YTQPNQATANK 

SEQ2955 SKWKVGVMTFSDTEKARWDKI EKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2 95 6 - -VSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2 957 -EVSASSTSSKVVKVGVMTFSDTEKARWDKI EKLVGDKAKI KFTEFTD YTQPNQATANK 

SEQ2 958 SASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2 95 9 QEVSAS STS SKWKVGVMTFSDTEKARWDKI EKLVGDKAKI KFTEFTD YTQPNQATANK 

SEQ2960 - -VSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2 950 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2951 DVD INAFQHYNFLENWNKENKKNL I PLEKTYLAP I RIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2952 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2953 DVD INAFQHYNFLENWNKENKKNL I PL EKTYIxAP I RIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2 954 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPI RIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2955 ' DVD INAFQHYNFLENWNKENKKNL I PL EKTYI^PIRI YSEKVKSLKKLKKGATIAI PNDA 

SEQ295 6 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRI YSEKVKSLKKLKKGATIAIPNDA 

SEQ2957 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2958 DVDINAFQHYNFLENWNKENKKNLIPLEKTY^PIRIYSEKVKSLKKLKKGATIAIPNDA 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 



SEQ2959 
SEQ2960 

SEQ2 95 0 
SEQ2951 
SEQ2952 
SEQ2 953 
SEQ2954 
SEQ2955 
SEQ2956 
SEQ2957 
SEQ2958 
SEQ2959 
SEQ2960 

SEQ2950 
SEQ2951 
SEQ2 952 
SEQ2 953 
SEQ2 954 
SEQ2955 
SEQ2 956 
SEQ2957 
SEQ295 8 
SEQ2959 
SEQ2 96 0 

SEQ2 95 0 
SEQ2 951 
SEQ2952 
SEQ2953 
SEQ2 954 
SEQ2 955 
SEQ2956 
SEQ2957 
SEQ2 958 
SEQ2959 
SEQ2960 



DVDINAFQHYNFLENWNKEmoaSTLIPLEKTYr^PIRIYSEKVKSLKKiKlCGATIAIPNDA 
DVDINAFQHY^NFLENWNKENKKNLIPLEKTYLAPIRI^ 

TNGSRAL YVLQ SAGL I KLNVSGKKVATVAN I TSNKKDI NI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLIKLWSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAII 
TNGSRAL YVLQSAGLIKLWSGKKVATVANITSNKKBINIQELDASQTPRALKDVDAAII 
TNGSRAL YVLQSAGLI KLNVSGKKVATVAN I TSNKKDI NI QELDAS QTPRALKDVDAAI I 
TNGSRAL^nni.QSAGLIKLNVSGKKVATVANITSNKKXJlNIQELDASQTPRALKDVDAAI^ 
TNGSRALYVLQSAGLI KLNVSGKKVATVAN I TSNKKDINI QELDAS QTPRALKDVDAAI I 
TNGSRAL YVLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAII 
TNGSRALYVLQSAGLI KLNVSGKKVATVANI TSNKKDINI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLI KLNVSGKKVATVAN I TSNKKDINI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLI KLNVSGKKVATVANI TSNKKDINI QELDASQTPRALKDVDAAI I 
TNGSRALYVLQSAGLI KLNVSGKKVATVANI TSNKKDINI QELDAS QTPRALKDVDAAI I 

NNTYIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYI EQANLKPSDAI FVEKSDKNS KQW IN 1 1 AGRKNWKKQKNAKA IQAILDAYHTDEVK 
NNTYI EQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYI EQANLKPSDAI FVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYI EQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYI EQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYI EQANLKPSDAI FVEKSDKNSKQWI NI I AGRKNWKKQKNAKAI QAI LDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNS KQWINI I AGRKNWKKQKNAKA I QAI WDAYHTDEVK 

KVI KDTS ADI PQWNPAFLY 

KVIKDTSADIPQW 

KVI KDTSAD I P 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKD 

KVIKDTSADIPQW 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ ID NO. 3001: SAG2147 FROM THE 1169NT1 GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCC 

AAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCT 

CCAAAACCTTCTCAGGCATCTAATGAAGTCCCAAAATCAAGTTCTCAATCTACAGAAGCT 

AATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCGGCTGTAGAACAAGCAGTTGTAACA 

GAAAATACCCCTGCTACCAGTCAGGCACAACAAACTTATGCTGTTACTGAGACAACTTAC 

AAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGCAATGGAAATACTGCAGGGGCG 

GTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGG 

GAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCT 

TCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGATCAAGTT 

AATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC 

SEQ ID NO- 3002: SAG2147 FROM THE 18RS21.GBS TYPE II STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTT CACAAGTTAC TAC TGAATCTTTGTCAAAAGC AGATAAAGTT C 
GCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAA 
AACAGGCTCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATCTA 
C AGAAGCTAATT CT CAGCAACAAGTTAC TGCGAGTGAAGAGGCAGCTGTAGAAC AAGCAG 
TTGTAACAGAAAAC AC CCCTGC TACCAGTC AGGC ACAACAAGCT TATGCTGTTACTGAGA 
CAACTTATAGACCTGCTCAACACCAGACGAGTGGCCAAGTATTGAGTAATGGAAATACTG 
CAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGT 
CTAC TTGGGAAC ATATTATTGC CCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCC T 
C AGGAGCTT C AGGACTTTTC C AAACGATGC CAGGTTGGGGTTCAACAG CTACAGTT C AGG 
ATCAAGTTAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC 

SEQ ID NO. 3003: SAG2147 FROM THE 2603 V/R GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGT 

TCGCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGT 

AAAACAGGCTCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATC 

TACAGAAGCTAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGC 

AGTTGTAAC AGAAAACACCCC TGCTAC C AGT CAGGC AC AACAAG CT TATGCTGTTACTG A 

GACAACTTATAGACCTGCTCAACACCAGACGAGTGGCCAAGTATTGAGTAATGGAAATAC 

TGCAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCA 

GTCTACTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGC 

CTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCA 

GGATCAAGTTAAT TC AGC TAT TAAAGC TTATCGTGCT CAAGGTTTATCAGCTTGGGGTTA 

C 

SEQ ID NO. 3004: SAG2147 FROM THE 090 GBS TYPE la STRAIN 
(REVERSE COMPLEMENT) 

TAGCCAAAAAATCAAAAATGATTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAAC 
AGGC TC CAAAAC CTTCT CAGG CAT CTAATGAAGC C CCAAAATCAAGTTC TCAATCTACAG 
AAGCTAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTG 
TAACAGAAAACACCCCTGCTACCAGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAA 
CTTATAGACCTG C T C AACACC AGACGAGTGGCCAAGTATTG AGT AATGG AAAT ACTGC AG 
GGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTA 
CTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAG 
GAGCTTCAGGACTTTTCC AAACGATGC CAGGTTGGGGTTCAACAG CTAC AGTTCAGGA 

SEQ ID NO. 3005: SAG2147 FROM THE A909 GBS TYPE la STRAIN 
(REVERSE COMPLEMENT) 

AAGGCGAC ATC T AAAT CAAAAGTAGAAG ATGTAAAAC AGGCTCC AAAAC CTTCT CAGGC A 
TCTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTAATTCTCAGCAACAAGTT 
ACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGAAAACACCCCTGCTACC 
AGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAACACCAG 
ACAAGTGGCCAAGTATTGAGTAATGGAAATACTGCAGGGGCTATTGGCTCAGCAGCTGCA 
GCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGGGAACATATTATTGCCCGT 
GAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACG 
ATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGAATCAAGTTAATTCAGCTATTAAAGCT 
TATCGTGCTCAAGGTTTATCA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ ID NO. 3006: SAG2147 FROM THE CJB110 GBS NONTYPEABLE STRAIN 
(REVERSE COMPLEMENT) 

AATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAAAAATCAAAAATGACTAAGGCGA 
CATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCAAAACCTTCTCAGGCATCTAATG 
AAGCCCCAAAAT CAAGTTC TCAAT CTAC AGAAGC TAATTC TCAGC AAC AAGTTAC TGCGA 
GTGAAGAGGCAGC TGTAGAAC AAGCAGTTGTAACAGAAAACAC CC CTGCTAC CAGTCAGG 
CACAACAAGCTTATGCTGTTACTGAGAC AACTTATAGAC CTGCTCAAC AC CAGACGAGTG 
GC CAAGTATTGAGTAATGGAAATACTGC AGGGGC TATTGGCT CAGCAG C TGC AGCAC AAA 
TGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGGGAACATATTATTGCCCGTGAATCAA 
ATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAG 
GTTGGGGTTCAACAGCTACAGTTCAGGATCAAGTTAATTCAGCTATTAAAGCTTATCGTG 
CTCAAGGTTTATCAGCTTGGGGTTAC 

SEQ ID NO, 3007: SAG2147 PROM THE COH1 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAA 

AGTTCGCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGA 
TGTAAAACAGGCTCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCA 
AT C T ACAGAAG C T AATT C T C AG C AAC AAGT TA C TG C G AGTGAAGAGG C GG C TGTAGAAC A 
AGCAGTTGTAAC AGAAAATACC CCTGC TACCAGTCAGGCACAACAAAC TTATGCTGTTAC 
TGAGACAACTTACAAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGCAATGGAAA 
TACTGCAGGGGCGGTCGGATCTGCTGC^GCAGCACAAATGGCTGCTGCAACAGGAGTCCC 
TCAGTCTACTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAA 
TGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGT 
TCAGGATCAAGTTAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGG 
TTAC 

SEQ ID NO. 3008: SAG2147 FROM THE H36b GBS TYPE lb STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTT C ACAAGTTAC TAC TGAATCTTTGTC AAAAGC 

AGATAAAGTTCGCGTAGCC AAAAAATCAAAAATGAC TAAGGCGAC ATC TAAATCAAAAGT 
AGAAGATGTAAAACAGGCTC C AAAAC CTTCTCAGGC AT CTAATGAAGC CCC AAAATC AAG 
TTCTCAATCTACAGAAGCTAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGT 
AGAACAAGC AGTTGTAACAGAAAACAC C C C TGCTAC C AGT CAGGC ACAACAAGCTTATGC 
TGTTACTGAGACAACTTATAGACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGTAA 
TGGAAATACTGCAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGG 
AGTCCCTCAGTCTACTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGT 
TGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGC 
TACAGTTCAGGATCAAGTTAATTCAGCTATTAAAGCTT 

SEQ ID NO. 3009: SAG2147 FROM THE M732 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGC 

CAAAAAAT C AAAAATG AC T AAG GC GAC AT C T AAAT CAAAAGTAGAAGATGT AAAAC AG GC 

TCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGC 

TAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCGGCTGTAGAACAAGCAGTTGTAAC 

AGAAAATACCCCTGCTACCAGTCAGGCACAACAAACTTATGCTGTTACTGAGACAACTTA 

CAAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGCAATGGAAATACTGCAGGGGC 

GGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTG 

GGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGC 

TTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGATCAAGT 

TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ ID NO. 3010: SAG2147 FROM THE M781 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

GTAACCC C AAGC TGATAAACCTTGAGCACGATAAGC T TTAATAGCTGAATTAACTTGAT C 
CTGAACTGTAGCTGTTGAACCCCAACCTGGCATCGTTTGGAAAAGTCCTGAAGCTCCTGA 
GGCATTAGCAACATTAGGATTACCATTTGATTCACGGGCAATAATATGTTCCCAAGTAGA 
CTGAGGGACTCCTGTTGCAGCAGCCATTTGTGCTGCAGCAGCAGATCCGACCGCCCCTGC 
AGTATTTCCATTGCTCAATACTTGGCCACTTGTCTGGTGTTGAGCAGGTTTGTAAGTTGT 
CTCAGTAACAGCATAAGTTTGTTGTGCCTGACTGGTAGCAGGGGTATTTTCTGTTACAAC 
TGCTTGTTCTACAGCCGCCTCTTCACTCGCAGTAACTTGTTGCTGAGAATTAGCTTCTGT 
AGATTGAGAACTTGATTTTGGGGCTTCATTAGATGCCTGAGAAGGTTTTGGAGCCTGTTT 
TACATCTTCTACTTTTGATTTAGATGTCGCCTTAGTCATTTTTGATTTTTTGGCTACGCG 
AACTTTATCTGCTTTTGACAAAGA 



SEQ3001 
SEQ3 002 
SEQ3003 
SEQ3004 
SEQ3005 
SEQ3007 
SEQ3 00 8 
SEQ300 9 
SEQ3010 



AGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCAAAACCTTCTCAGGCA 



SEQ3001 
SEQ3002 
SEQ3003 
SEQ3004 
SEQ3005 
SEQ3 007 
SEQ3 00 8 
SEQ3009 
SEQ3010 

SEQ3 001 
SEQ3002 
SEQ3003 
SEQ3004 
SEQ3005 
SEQ3007 
SEQ3008 
SEQ3009 
SEQ3010 

SEQ3 001 
SEQ3 0 02 
SEQ3003 
SEQ3004 
SEQ3 00 5 
SEQ3007 
SEQ3008 
SEQ30 09 
SEQ3010 

SEQ3001 
SEQ30G2 
SEQ3003 
SEQ3004 
SEQ3005 
SEQ3007 
SEQ3008 
SEQ3009 
SEQ3010 



CTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTAATTCTCAGCAACAAGTT 



CTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGAAAACACCCCTGCTACC 



GTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAACACCAG 



CAAGTGGCCAAGTATTGAGTAATGGAAATACTGCAGGGGCTATTGGCTCAGCAGCTGCA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ3001 

SEQ3002 

SEQ3003 

SEQ3004 

SEQ3 005 CACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGGGAACATATTATTGCCCGT 

SEQ3007 

SEQ3008 

SEQ3009 

SEQ3010 

SEQ3001 

SEQ3002 

SEQ3003 

SEQ3004 

SEQ3 005 AATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACG 

SEQ3007 

SEQ3008 

SEQ3009 

SEQ3010 



SEQ3001 

SEQ3002 

SEQ3003 

SEQ3004 

SEQ3 005 TGCCAGGTTGGGGTTCAACAGCTACAGTTCAGAATCAAGTTAATTCAGCTATTAAAGCT 

SEQ3007 

SEQ3008 

SEQ3009 

SEQ3010 

SEQ3 001 AAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3 002 AAAGTTGACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3003 AAAGTT CACAAGTTACTACTGAAT CTTTGT CAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3004 TAGCCAAA 

SEQ3 005 ATCGTGCTCAAGGTTTATCASAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3 007 AAAGTT CACAAGTTACTACTGAATCTTTGT CAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3 008 AAAGTTCACAAGTTACTAC TGAATCTTTGT CAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3 009 AAAGTTCaC^GTTACTACTGAATCTTTGTO^U^GCAGATAAAGTTCGCGTAGCCA^ 

SEQ3010 GTAACCCCAAGCTGA TAAAC C T TGAGCACGATAAG C T T TAATAG C TGAA 

SEQ3 001 AATCAAAAATGACTAAGGCGAC^TCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 002 AATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 003 AATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 004 AATCAAAAATGATTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 005 AATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 007 AAT CAAAAATGACTAAGGCGACATCTAAAT CAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 008 AATCAAAAATGACTAAGGCGACAT CTAAAT CAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 009 AATCAAAAATGACTAAGGCGACAT CTAAAT CAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 010 TAACTTGATCCTGAACTGTAGCTGTTGAACCCCAACCTGGCATCGTTTGGAAAAGTCCT 

SEQ3 001 AACCTTCTCAGGCATCTAATGAAGTCCC^AAATCAAGTTCTCAATCTAOVGAAGCTAAT 

SEQ3002 AACCTT CTCAGGCAT CTAATGAAGCC C CAAAAT CAAGTTCTCAATC TACAGAAG CTAAT 

SEQ3003 AACCTTCT(^GGCATCTAATGAAGCCCO\AAATC^GTTCTCMTCTACAGAAGCTAAT 

SEQ3 004 AACCTTCTCAGGCATCTAATGAAGCCCOUUVAT(^AAGTTCTCAATCTAC^GAAGCTAAT 

SEQ3 005 AACCTTCTCAGGCATCTAATGAAGCCCClAAAATCAAGTTCTa^TCTACAGAAGCTAAT 

SEQ3 007 AACCTTCTCAGGCATCTAATGAAGCCCCAAAATC7VAGTTCTCAATCTACAGAAGCTAAT 

SEQ3 008 AACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTAAT 

SEQ3 009 AACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATCTAC^GAAGCTAAT 

SEQ3 010 AAGCTCCTGAGGCATT - - -AGCAACATTAGGATTAC - CATTTGATTCACGGGCAATAAT 

SEQ3 001 TCTC^GC^C^GTTACTGCGAGTGAAGAGGCGGCTGTAGAACAAGCAGTTGTAACAGA 

SEQ3002 TCTCAGCAACMGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGA 

SEQ3 003 TCTCAGCAACAAGTTACTGCGAGTGAAGAGGC^GCTGTAGAACAAGC^GTTGTAA(^GA 

SEQ3 004 TCTCAGCAACAAGTTA CTG CGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGA 

SEQ3 005 TCT CAGCAACAAGTTACTG CGAGTGAAGAGGCAGC TGTAGAACAAGCAGTTGTAACAGA 

SEQ3 007 TCTCAGCAAC^GTTACTGCGAGTGAAGAGGCGGCTGTAGAACAAGCAGTTGTAACAGA 

SEQ3 008 TCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGA 

SEQ3009 TCTCAGCAACAAGTTACTGCGAGTGAAGAGGCGGCTGTAGAA(^GCAGTTGTAACAGA 

SEQ3 010 TGTTCCCAAGTAGACTGAGGGACTCCTGTTGCAGC^GC(2ATTTGTGCTGCAGCAGC^GA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ3 001 - - AAATACCCCTGCTACCAGT(^GG(^CAAC7VAACTTATGCTGTTACTGAGACAACTTA 

SKQ3002 --AAA(^CCCCTGCTACCAGT(^GGCACAAC^AGCTTATGCTGTTACTGAGACAACTTA 

SEQ3 003 - -AAACACCCCTGCTAC(^GTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTA 

SEQ3 004 - -AAACACCCCTGCTACCAGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTA 

SEQ3 005 - - AAACACCCCTGCTAC(^GTCAGGCACAAC^GCTTATGCTGTTACTGAGACMCTTA 

SEQ3 007 - -AAATACCCCTGCTACC^GTCAGGCACAAC^ACTTATGCTGTTACTGAGACAACTTA 

SEQ3 008 - - AAACACCCCTGCTACCAGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTA 

SEQ3 009 - -AAATACCCCTGCTACCAGTCAGGCACAACAAACTTATGCTGTTACTGAGACAACTTA 

SEQ3010 CCGACCGCCCCTGCAGTATTTCCATTGCTCAATACTTG-GCCACTTGTCTGGTGTTGAG 

SEQ3 001 AAACCTGCTCAACACCAGACAAGTGGC - CAAGTATTGAGCAATGGAAATACTGCAGGGG 

SEQ3 002 AGACCTGCTCAACACCAGACGAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3 003 AGACCTGCTCAACACCAGACGAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3004 AGACCTGCTCAACACCAGACGAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3 005 AGACCTGCTCAACACCAGACGAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3 007 AAACCTGCTCAACACCAGACAAGTGGC - CAAGTATTGAGCAATGGAAATACTGCAGGGG 

SEQ3 008 AGACCTGCTCAACACCAGACAAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3009 AAACCTGCTCAACACCAGACAAGTGGC - CAAGTATTGAGCAATGGAAATACTGCAGGGG 

SEQ3 010 AGGTTTGTAAGTTGTCTCAGTAACAGCATAAGTTTGTTGTGCCTGACTGGTAGCAGGGG 



SEQ3 00 1 GGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTT 

SEQ3 002 TATTGGCTCAGC^GCTGCAGCACAAATGGCTGCTGCAACAGGAGT^ 

SEQ3 003 TATTGGCTCAGCAGCTG(^GC^CAAATGGCTGCTGC^CAGGAGTCCCTCAGTCTACTT 

SEQ3004 TATTGGCTCAGCAGCTGCAGC^CAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTT 

SEQ3 005 TATTGGCTCAGCAGCTGC^GCACAAATGGCTGCTGCAACAGGAGTC 

SEQ3007 GGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGC^CAGGAGTCCCTCAGTCTACTT 

SEQ3 008 TATTGGCTCAGC^GCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTT 

SEQ3 009 GGTCGGATCTGCTGCTGC^G(^CAAATGGCTGCTG(^^CAGGAGTCCCTCAGTCTACTT 

SEQ3010 A - TTT- - TCTGTTACAACTGCTTGTTCTACAGCCGCCTCTTCACTCGCAGTAACTTGTT 



SEQ3 001 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 002 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 003 GGGAACATATTATTGC CCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCT CAGGAG 

SEQ3004 GGGAACATATTATTGCCCGTGAATCAAATGGTAAT CCTAATGTTGCTAATGCCT CAGGAG 

SEQ3 005 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCT CAGGAG 

SEQ3 007 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 008 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 009 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 010 GCTGAGA - ATTAGCTTCTGTAGATTGAG AA - - CTTGATTTTGGGGCTTCATTAGATG 

SEQ3 001 CTTCAGGACTTTTC(^AAACGATGCCAGGTTGGGGTTC^CAGCTACAGTTC^GGATCAAG 

SEQ3 002 CTTCAGGACTTTTCCAAACa^TGCa^GGTTGGGGTTCAACAGCTAC^GTTC^GGATCMG 

SEQ3 003 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTC^ 

SEQ3 004 CTTCAGGACTTTTCCAAACGATGCC^GGTTGGGGTTCAACAGCTACAGTTCAGGA 

SEQ3 005 CTTC!AGGACTTTTC(ZAAACGATGCCA.GGTTGGGGTTCAACAGCTACAGTTCA 

SEQ3 007 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAAC 

SEQ3 008 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAG^ 

SEQ3 009 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAG^ 

SEQ3 0 1 0 CCTGAGAAGGTTTT GGAGCCTGTTTTACATCTTCTACTTTTGATTTAGATGTCGC 

SEQ3 001 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC - - 

SEQ3 002 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC- - 

SEQ3003 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC - - 

SEQ3004 

SEQ3 005 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC- - 

SEQ3 007 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC - - 

SEQ3 008 TAATTCAGCTATTAAAGCTT 

SEQ3 009 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTA- - - 

SEQ3010 TTAGTCA-TTTTTGATTTTTTGGCTACGCGAACTTTATCTGCTTTTGACAAAGA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

>SEQ ID NO 3050: 25_JL169NT frame: 1 

KSSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEVPKSSSQSTEAN 
S Q Q Q VT AS E EAAVE QA W T ENT PAT S Q AQ Q T YAVT E T T Y K P AQHQTS GQVL S NGNTAGA V 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANAS GASGL F QTMPGWGS TATVQDQVN 
SA I KA YRAQGLS AWG Y 

>SEQ ID NO 3051:25_18RS21 frame: 1 

KSSQVTTESLS kadkvrvakkskmtkatskskvedvkqapkps qasneapksssqstean 

SQQ QVTAS E EAAVEQAVVTENTPATS QAQQAYAVTETT YRPAQHQTS GQVLSNGNTAGAI 
GSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANASGASGLPQTMPGWGSTATVQDQVN 
SAIKAYRAQGLSAWGY 

>SEQ ID NO 3 052:25_2603 frame: 1 

KSSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SQQQVTASEEAAVEQAWTENTPATSQAQQAYAVTETTYRPAQHQTSGQVLSNGNTAGAI 
GSAAAAQMAAATGVPQS TWEHI I ARE S NGN PNVANA S GAS GLFQTMPGWGSTATVQDQVN 
SAIKAYRAQGLSAWGY 

>SEQ ID NO 3053:25_090 frame: 3 

AKKS KM I KATSKSKVEDVKQAPKPSQASNEAPKS S SQS TEANS QQQVTAS EEAAVEQAW 
TENTPATS QAQQAYAVTETT YRPAQHQTSGQVLSNGNTAGAI GSAAAAQMAAATGVPQS T 
WEHI I ARESNGNPNVANAS GAS GLFQTMPGWGSTATVQ 

>SEQ ID NO 3054:25_A909 frame: 1 

KATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEANSQQQVTASEEAAVEQAWTENTPAT 
SQAQQAYAVTETT YRPAQHQTS GQVLSNGNTAGAI GS AAAAQMAAATGVPQS TWEH 1 1 AR 
ESNGNPNVANASGAS GL FQTMPGWGS TATVQNQVNSAI KA YRAQGL S 

>SEQ ID NO 3 055:25_CJB110 frame: 3 

SL S KADKVRVAKKS KMTKATSKS KVEDVKQAP KP S QAS NEAPKS S S QS TEANSQQQ VTAS 
EEAAVE QAWTENTPATS QAQQAYAVTETT YRPAQHQTS GQVL SNGNTAGAI GS AAAAQM 
AAATGVPQS TWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAIKAYRA 
QGLSAWGY 

>SEQ ID NO 3056:25__COH1 frame: 1 

KSSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAP KPSQASNEAPKSSSQSTEAN 
SQQQVTASEEAAVEQAVVTENTPATSQAQQTYAVTETTYKPAQHQTSGQVLSNGNTAGAV 
GS AAAAQMAAATGVPQ S TWEHI I ARESNGNPNVANASGAS GL FQTMPGWGS TATVQDQVN 
SAIKAYRAQGLSAWGY 

>SEQ ID NO 3057:25_H36B frame: 1 

KSSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SQQQVTASEEAAVEQAWTENTPATSQAQQAYAVTETTYRPAQHQTSGQVLSNGNTAGAI 
GSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANAS GAS GLFQTMPGWGSTATVQDQVN 
SAIKA 

>SEQ ID NO 3058:25_M732 frame: 1 

KSSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAP KPSQASNEAPKSSSQSTEAN 
SQQQVTAS EEAAVEQAWTENTPATS QAQQTYAVTETTYKP AQHQTS GQVL SNGNTAGAV 
GSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANASGAS GLFQTMPGWGSTATVQDQVN 
SAIKA YRAQGLS AWG 

>SEQ ID NO 3059:25_M781 frame: 4 

SLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEANSQQQVTAS 
EEAAVEQA WTENT PATS QAQQTYAVTETTYKPAQHQTS GQVL SNGNTAGAVGS AAAAQM 
AAATGVPQ S TWEHI I ARES NGNPNVANASGAS GLFQTMPGWGS TATVQDQVNS AI KAYRA 
QGLSAWGY 



SEQ3050 SSQVTTESLSKADKWVAKKS KMTKATSKS KVEDVKQAPKPSQASNEVPKSSSQSTEAN 

SEQ3051 SSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3052 SSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ30S3 AKKS KMIKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3 054 KATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3055 SLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3 05 6 SSQVTTESLSKADKVRVAKKS KMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3 057 SSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3 058 SSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3059 SLSKADKVRVAKKS KMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 



SEQ3050 
SEQ3051 
SEQ3052 
SEQ3053 
SEQ3054 
SEQ3 055 
SEQ3056 
SEQ3057 
SEQ3058 
SEQ3059 

SEQ3050 
SEQ3051 
SEQ3052 
SEQ3053 
SEQ3054 
SEQ3055 
SEQ3056 
SEQ3057 
SEQ3058 
SEQ3059 

SEQ3050 
SEQ3051 
SEQ3052 
SEQ3053 
SEQ3054 
SEQ3055 
SEQ3056 
SEQ3057 
SEQ3 05 8 
SEQ3059 



S QQQVTAS EEAAVEQAWTENTPATS QAQQTYAVTETT YKPAQHQT S GQVL SNGNTAGAV 
S QQQVTAS EEAAVEQAVVTEINT PATSQAQQAYAVTETT YRPAQHQTSGQVLSNGNTAGA I 
S Q Q Q VTA S EEAAVEQAWT ENT PAT S Q AQ Q AYAVTE T T YR P AQHQ T S GQVL S NGNTAGA I 
S QQQVTAS EEAAVEQAWTENT PATSQAQQAYAVTETT YRPAQHQTSGQVLSNGNTAGAI 
S Q Q QVT A S E EAAVE QA WTENT P ATS Q AQ QA YAVTE TT YR PAQHQ T S GQ VL SNGNTAGA I 
SQQQVTASEEAAVEQAWTENTPATSQAQQAYAVTETT YRPAQHQTSGQVLSNGNTAGAI 
SQQQVTAS EEAAVEQAVVTENTPATS QAQQTYAVTETT YKPAQHQTS GQVL SNGNTAGAV 
SQQQVTASEEAAVEQAVVTENTPATSQAQQAYAVTETTYRPAQHQTSGQVLSNGNTAGAI 
SQQQVTASEEAAVEQAWTENTPATSQAQQTYAVTETTYKPAQHQTSGQVLSNGNTAGAV 
SQQQVTAS EEAAVEQAWT ENT PATSQAQQTYAVTETTYKPAQHQTSGQVLSNGNTAGAV 

GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQTO 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANAS GAS GLFQTMPGWGS TATVQDQVN 

GSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANASGASGLFQTMPGWGSTATVQ 

GSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANASGASGLFQTMPGWGSTATVQNQVN 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARE SNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 

AI KAYRAQGLS AWGY 
AI KAYRAQGLS AWGY 
AIKAYRAQGLSAWGY 



AI KAYRAQGLS 

AI KAYRAQGLSAWG Y 
AIKAYRAQGLSAWGY 

AIKA 

AI KAYRAQGLSAWG - 
AIKAYRAQGLSAWGY 
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Table 31: Comparative Sequences relating to SAG2148 
(LysM domain protein) 

SEQ ID NO. 3101: SAG2148 FROM THE 1169NT1 GBS TYPE V STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GC AGAAGCAAAAT C TCAAC C AAC AATTGAAAATTCAATGAATTCTTCATC AAATTTGAGTTCAAGTGATTCAGCTGCA 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATAC TGC ACAGAATGGACAATAT TATGGAAG ATAT C AACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3102: SAG2148 FROM THE 18RS21 GBS TYPE II STRAIN 

GCATCTTATAC CGTGAAAT CAGGTGATAC CTTATCAGC TATTGCTAAAAATC AT AAAACT ACGGTACAAGAGTTAGTG 
T CT CTC AATAGTATC AGTAACGCTGATGTC ATC AGTATAGGTGATGTTTTAAAATTGGAT AATTCTAC AG C TAGTCAA 
GCAGAAGCAAAAT CTCAACCAAC AATTGAAAATTCAATGAATTCTTC ATC AAATTTGAGTT CAAGTGATTC AGC CGCA 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGTTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3103: SAG2148 FROM THE 2603 V/R GBS TYPE V STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
G C AGAAGCAAAAT C T C AAC C AAC AAT T G AAAAT T C AATGAAT T C T T C AT C AAAT T T G AGT T C AAGTGATT C AG C CG C A 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGTTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3104: SAG2148 FROM THE 090 GBS TYPE la STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTC TCAATAGTATCAGTAACGC TGATGT CATCAGTATAGGTGATGTTTTAAAATTGGAT AATTCTAAAG CTAGT C AA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGTTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3105: SAG2148 FROM THE A909 GBS TYPE la STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
T CTCTCAAT AGT ATC AGT AACGC TGATGT CAT CAGT ATAGGTGATGTTT TAAAATTGGATAATT C TACAGCT AGTCAA 
GCAGAAGCAAAAT C TCAAC C AACAATTGAAAATTCAATGAATT CTTCATCAAATTTGAGT TCAAGTGATT CAGC CGC A 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3106: SAG2148 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

GCATCTTATAC CGTGAAATCAGGTGATACC TTATCAGCTATTGCTAAAAAT C ATAAAACTACGGTAC AAGAGT TAGTG 
T CT CTC AAT AGT AT CAGT AACGC TGATGT CATC AGTATAGGTGATGTTTTAAAATTGGAT AATTC TAAAGCT AGT CAA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCT CGT CGTGAATCAAATGGTAGTTAT AC TGCACAG AATGGAC AATATTATGGAAGATAT CAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGTTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3107: SAG2148 FROM THE COH1 GBS TYPE III STRAIN 

GCATCTTATAC CGTGAAAT CAGGTGATAC CTTAT CAGC TATTGCTAAAAATC ATAAAACTACGGTAC AAT AGT TAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GCAGAAGCAAAAT CTC AAC C AAC AATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGC A 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 
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Table 31: Comparative Sequences relating to SAG2148 
(LysM domain protein) 

SEQ ID NO. 3108: SAG2148 FROM THE H36b GBS TYPE lb STRAIN 
(REVERSE COMPLEMENT) 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3109: SAG2148 FROM THE JM9130013 GBS TYPE VIII STRAIN 
(REVERSE COMPLEMENT) 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGACGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAACTAGTCAA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCT CGT CGTGAATCAAATGGTAGT TATACTGCACAGAATGGACAATAT TATGGAAGATAT CAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3110: SAG2148 FROM THE M732 GBS TYPE III STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAATAGTTAGTG 
TCT CTC AATAGTATC AGTAACGCTGATGT CAT C AGT ATAGGTGATGTTTTAAAATTGGAT AATT CTAC AGCTAGTCAA 
G C AGAAGC AAAAT C T C AAC C AAC AATT GAAAAT T C AATGAAT T C T T CAT C AAAT T T G AGT T C AAGTGAT T C AG C TG C A 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTT ACCT AAATGGCGACTT ATCTCCTGAAAATCAAGAAAAAGTAGCGGACAAT TATGTGGCT TCT CGTT AC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3111s SAG2148 FROM THE M781 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAATAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GCAGAAGCAAAATC TC AAC CAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGT TCAAGTGAT TCAGCTGCA 
AAAGAAGAAATAGC TCGT CGTGAATC AAATGGTAGTTATACTGC ACAGAATGGACAATATTATGGAAGATAT CAACTG 
TCT CAATCTT ACCT AAATGGCGACTT ATCTC C TG AAAAT CAAGAAAAAGTAGCGGAC AATT ATGTGGC TTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ 3 101 GCATCT TATA CCGTGAAATCAGGTGATACCT TAT CAGC TAT TGCTAAAAATCATAAAACT 

SEQ3 102 GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
SEQ3 10 3 GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
SEQ3 10 4 GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
SEQ3 10 5 GCAT CTT ATACCGTGAAAT CAGGT GAT ACCT TAT CAGC T ATTG CT AAAAAT CAT AAAAC T 

SEQ 3 10 6 GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
SEQ3 107 GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
SEQ3 108 G(^TCTTATACCGTGAAATC^GGTGATACCTTATC^GCTATTGCTAAAAATCATAAAACT 
SEQ3 10 9 GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
SEQ3 11 0 G CATC TTATACCGTGAAATCAGGTGATACCTTAT CAGC TAT TGCTAAAAATCATAAAACT 

SEQ3111 GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 

SEQ3 101 ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3 102 ACGGTACAAGAGTTAGTGTCTCTC^TAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3 103 ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3 104 ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3105 ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3 106 ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3107 ACGGTACAATAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3 108 ACGGTACAAGAGTTAGTGT CTC TCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3 109 ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGACGTCATCAGTATAGGT 

SEQ3 110 ACGGTACAATAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 

SEQ3111 ACGGTACAATAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
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Table 31: Comparative Sequences relating to SAG2148 
(LysM domain protein) 



SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ310 6 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 

SEQ3X01 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ310 6 
SEQ3107 
SEQ3108 
SEQ3X09 
SEQ3110 
SEQ3111 



GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGC^GAAGCAAAATCTC^CCAA^ 
GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 
GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 
GATGTTTTAAAATTGGATAATTCTAAAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 
GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 
GATGTTTTAAAATTGGATAATTCTAAAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 
GATGTTTTAA^TTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 
GATGTTTTAAAATTGGATAATTCTA(^GCTAGTCAAGCAGAAGCAAAATCTCAACCM.CA 
GATGTTTTAAAATTGGATAATTCTACAACTAGTCAAGCAGAAGCAAAATCTCAACCAACA 
GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 
GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGGk^ 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCJATCAAATTTGAGTTCAAGTGATTC^ 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTC^ 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAAAATTCA^TGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAA^TTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGaVAAAGAA 



SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 

SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 

SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ310 8 
SEQ3109 
SEQ3110 
SEQ3111 



GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAG C T CGTCGT GAATCAAATGGTAGT TATACTG CACAGAATG GACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAG CTCGTCGTGAATCAAATGGTAGT TAT AC TGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 

AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTGAA.TCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATAT CAACTGTCTCAAT CTTACCTAAATGGCGACTTAT CTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTC^TCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATAT CAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGA?VAATCAAGAAAAA 

GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGTTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGTTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGTTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGTTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
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Table 31: Comparative Sequences relating to SAG2148 
(LysM domain protein) 



SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 



AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 



>SEQ ID NO 3150:15_1169NT frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQELVSLNS I SNADVI S IGDVLKLDNSTASQAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3151:15_18RS21 frame: I 

AS YTVKSGDTLSAI AKNHKTTVQELVSLNS I SNADVI S IGDVLKLDNSTASQAEAKSQPT . 
I ENSMNSS SNLS SSDSAAKEE I ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
VADNYWSRYGSWSAALSFWNSNGWY 



>SEQ ID NO 3152:15_2603 frame: 1 

ASYTVKSGDTLSAIAKNHKTTVQELVSLNSISNADVISIGDVLKLDNSTASQAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYWSRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3153:15_090 frame: 1 

AS YTVKSGDTLSAIAKNHKTTVQELVSLNS I SNADVI S I GDVL KLDNS KAS QAEAKSQPT 
I ENSMNSS SNLS SSDSAAKEE I ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
VADNYWSRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3154:15_A909 frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQELVSLNS I SNADVI S I GDVL KLDNS TAS QAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3155:15_CJB110 frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQELVSLNS I SNADVI SI GDVLKLDNS KASQAEAKSQPT 
I ENSMNSS SNLS SSDSAAKEE I ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
VADNYWSRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3156:15_C0H1 frame: 1 

ASYTVKSGDTLSAIAKNHKTTVQ . LVSLNS I SNADVI S IGDVLKLDNSTASQAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3157:15JH36B frame: 1 

AS YTVKS GDTLS AI AKNHKTTVQELVS LNS I SNADVI S I GDVLKLDNS TAS QAEAKS QPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3158: 15_JM9130013 frame: 1 

ASYTVKSGDTLSAIAKNHKTTVQELVS LNS I SNADVI SIGDVLKLDNSTTSQAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3159:15_M732 frame: 1 

ASYTVKSGDTLSAIAKNHKTTVQ . LVSLNS I SNADVI S IGDVLKLDNSTASQAEAKSQPT 
I ENSMNS S SNL S S S DSAAKEE I ARRESNGS YTAQNGQY YGRYQLSQ S YLNGDLS PENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3160:15_M781 frame: 1 

ASYTVKSGDTLSAIAKNHKTTVQ . LVSLNS I SNADVI SI GDVL KLDNS TAS QAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGS YTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 
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Table 31: Comparative Sequences relating to SAG2148 
(LysM domain protein) 



SEQ3150 
SKQ3151 
SEQ3152 
SEQ3153 
SEQ3154 
SEQ3155 
SEQ3156 
SEQ3157 
SEQ3158 
SEQ3159 
SEQ3160 

SEQ3150 
SEQ3151 
SEQ3152 
SEQ3153 
SEQ3154 
SEQ3155 
SEQ3156 
SEQ3157 
SEQ3158 
SEQ315 9 
SEQ316 0 

SEQ3150 
SEQ3151 
SEQ3152 
SEQ3153 
SEQ3154 
SEQ3155 
SEQ3156 
SEQ3157 
SEQ3158 
SEQ3159 
SEQ3160 



ASYTVKSGDTLSAIAKNHKTTVQELVSLNSISNADVISIGDVLKLDNSTASQAEAKSQPT 
AS YTVKSGDTL SAI AKNHKTTVQELVSLNS I SNADVI S I GDVLKLDNS TAS QAEAKSQPT 
A S YTVKS GD TL S A I AKNHKT TVQE LVS LNS I SNAD V I S I GDVL KLDNS TAS Q AEAKS Q P T 
AS YTVKSGDTL SAI AKNHKTTVQELVSLNS I SNADVI S I GDVLKLDNS KASQAEAKSQPT 
AS YTVKSGDTL SAI AKNHKT WQELVSIiNS I SNAD VI SI GDVIiKLDNS TAS QAEAKSQPT 
AS YTVKSGDTL SAI AKNHKTTVQELVSLNS I SNADVI SIGDVLKLDNSKASQAEAKSQPT 
AS YTVKSGDTL SAIAKNHKTTVQ - LVSLNS I SNADVI S IGDVLKLDNSTAS QAEAKSQPT 
ASYTVKSGDTLSAIAKNHKTTVQELVSLNSISNADVISIGDVLKLDNSTASQAEAKSQPT 
AS YTVKSGDTLSAI AKNHKTTVQELVSLNS I SNADVI S IGDVLKLDNSTTS QAEAKSQPT 
AS YTVKS GDTLSAIAKNHKTTVQ - LVSLNS I SNADVIS IGDVLKLDNSTASQAEAKSQPT 
AS YTVKSGDTLSAI AKNHKTTVQ - LVSLNS I SNADVIS IGDVLKLDNSTASQAEAKSQPT 

IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
IENSMNS SSNLS SSDSAAKEEIARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
IENSMNS SSNLS SSDSAAKEEIARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
IENSMNS SSNLS SSDSAAKEEIARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 

VADNYVASRYGSWSAALSFWNSNGWY 
VADNYWSRYGSWSAALSFWNSNGWY 
VADNYWSRYGSWSAALSFWNSNGWY 
VADNYWSRYGSWSAALSFWNSNGWY 
VADNYVASRYGSWSAALSFWNSNGWY 
VADNYWSRYGSWSAALSFWNSNGWY 
VADNYVASRYGSWSAALSFWNSNGWY 
VADNYVASRYGSWSAALSFWNSNGWY 
VADNYVASRYGSWSAALSFWNSNGWY 
VADNYVASRYGSWSAALSFWNSNGWY 
VADNYVASRYGSWSAALSFWNSNGWY 
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Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00003 


SAGO 01 7 


447 


PcsB protein 


ORF00004 


SAGO0 18 


322 


ribose-phosphate pyrophosphokinase 


ORF00005 


SAGO0 19 


391 


aminotransferase, class I 


ORF00006 


SAG0020 


253 


recombination protein O 


ORF00008 


SAG0021 


283 


protease, putative 


ORF00009 


SAG0022 


330 


fatty acid/phospholipid synthesis protein PIsX 


ORF00010 


SAG0023 


79 


acyi carrier protein 


ORF00011 


SAG0024 


234 


phosphoribosylaminoimidazole-succinocarboxamide 
synthase 


ORF00012 


SAG0025 


1241 


phosphoribosylformylglycinamidine synthase, putative 


ORF00013 


SAG0026 


484 


amidophosphoribosyltransferase 


ORF00014 


SAG0027 


340 


phosphoribosylformylglycinamidine cyclo-ligase 


ORF00015 


SAG0028 


182 


phosphoribosylglycinamideformyltransferase 


ORF00016 


SAG0029 


250 


acetyltransferase, GNAT family 


ORF00017 


SAG0030 


515 


phosphoribosylaminoimidazolecarboxamide 
formyltransferase/IMP cyclohydrolase 


ORF00018 


SAG0031 


283 


peptidase, M23/M37 family 


ORF00020 


SAG0032 


434 


group B streptococcal surface immunogenic protein 


ORF00021 


o a pnnoo 

SACaOUoo 




N-acetylmannosamine-6-P epimerase, putative 


ORF00022 


SAG0034 


438 


sugar ABC transporter, sugar-binding protein 


ORF00023 


SAG0035 


295 


sugar ABC transporter, permease protein 


ORF00024 


SAG0036 


276 


sugar ABC transporter, permease protein 


ORF00025 


SAG0037 


147 


conserved hypothetical protein 


ORF00026 


SAG0038 


220 


conserved hypothetical protein 


ORF00027 


SAG0039 


305 


N-acetylneuraminate lyase, putative 


ORF00028 


SAG0040 


293 


ROK family protein 


ORF00029 


SAG0041 


325 


acetyl xylan esterase, putative 


ORF00030 


SAG0042 


267 


phosphosugar-binding transcriptional regulator, RpiR 
family, putative 


ORF00031 


SAG0043 


421 


phosphoribosylamine-glycine ligase 


ORF00032 


SAG0044 


162 


phosphoribosylaminoimidazole carboxylase, catalytic 
subunit { 


ORF00033 


SAG0045 


363 


phosphoribosylaminoimidazole carboxylase, ATPase 
subunit 


ORF00035 


r SAG0046 


463 


hypothetical protein 


ORF00036 


SAG0047 


432 


adenylosuccinate lyase 


ORF00037 


SAG0048 


303 


transcriptional regulator, Cro/CI family 


ORF00038 


SAG0049 


332 


Holhday junction DNA helicase RuvB 


ORF00039 


SAG0050 


145 


phosphotyrosine protein phosphatase, low molecular 
weight 


ORF00040 


oAoUUOl 


•i Of? 


MORN motif family protein 


ORr00U41 




COO 


membrane protein, putative 


ORF00042 


SAG0053 


880 


aldehyde-alcohol dehydrogenase 


ORF00043 


SAG0054 


338 


alcohol dehydrogenase, propanol-preferring 


ORF00044 


SAG0055 


496 


threonine synthase 


(JRrUUU4o 


OnoUUOD 


AAO 


w\r\ i tz eiTiux Tamuy proiein 


ORF00046 


SAG0057 


102 


ribosomal protein S10 


ORF00047 


SAG0058 


208 


ribosomal protein L3 


ORF00048 


SAG0059 


207 


ribosomal protein L4 


ORF00049 


SAG0060 


98 


ribosomal protein L23 


ORF00050 


SAG0061 


277 


ribosomal protein L2 


ORF00052 


SAG0062 


92 


ribosomal protein S19 


ORF00054 


SAG0063 


114 


ribosomal protein L22 


ORF00055 


SAG0064 


217 


ribosomal protein S3 
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Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00056 


OA f^>f\f~\GIZ 

SAoUUbo 


137 


ribosomal protein L16 


ORF00058 


SAvjUUdd 


68 


ribosomai protein L29 


i UKrOOOoa 


oAoUUb f 


86 


ribosomal protein S17 


URrOOOoO 


oAoUUbo 


122 


ribosomal protein L14 


! ORF00061 


SAG0069 


101 


ribosomal protein L24 


ORF00063 


SAG0070 


180 


ribosomal protein L5 


ORF00064 


SAG0071 


61 


ribosomal protein S14, putative 


ORF00065 


SAG0072 


132 


ribosomal protein S8 


ORF00066 


SAG0073 


178 


ribosomai protein L6 


ORF00068 


SAG0074 


118 


ribosomal protein L18 


ORF00069 


SAG0075 


164 


ribosomal protein S5 


ORF00070 


SAG0076 


59 


ribosomal protein L30 


ORF00071 


SAG0077 


146 


ribosomal protein L15 


ORF00072 


SAG0078 


434 


preprotein translocase, SecY subunit 


ORF00073 


SAG0079 


212 


adenylate kinase 


ORF00074 


SAG0080 


72 


translation initiation factor IF-1 | 


ORF00075 


SAG0081 


38 


ribosomal protein L36 


ORF00077 


SAG0082 


121 ! 


ribosomal protein S13 


ORF00078 


SAG0083 


118 


ribosomal protein S1 1 


ORF00080 


SAG0084 


312 


DNA-directed RNA polymerase, alpha subunit 


ORF00081 


SAG0085 


128 


ribosomal protein L17 


ORF00087 


SAG0086 


97 


hypothetical protein 


ORF00088 


SAG0087 


59 


hypothetical protein 


ORF00089 


SAG0088 


56 


hypothetical protein 


ORF00090 


SAG0089 


183 


conserved hypothetical protein 


ORF00091 


SAG0090 


139 


conserved hypothetical protein 


ORF00093 


SAG0091 


144 


transcriptional regulator ComX1, putative 


ORF00094 


SAG0092 


230 


phosphoglycerate mutase family protein 


ORF00095 


SAG0093 


250 


D-alanyl-D-alanine carboxypeptidase family protein 


ORF00096 


SAG0094 


191 


N-acetylmuramoyl-L-alanine amidase, family 4 protein 


ORF00097 


OAoUOao 


O A A 

o44 


neat-mauciDie transcription repressor nrcM 


ORF00098 


OAr*Anno 
oACsOOyb 


W AA 


heat shock protein GrpE 


r~/*\rt r\r\f^ 

ORF00099 


O A PAAA7 


buy 


dnaK protein 


ORF00100 


SAG0098 


379 


dnaJ protein 


ORF00101 


SAG0099 


415 


transcriptional regulator, GntR family 


ORF00102 


SAG0100 


258 


tRNA pseudouridine synthase A 


ORF00103 


SAG0101 


252 


phosphomethylpyrimidine kinase, putative 


ORF00104 


SAG0 102 


154 


conserved hypothetical protein 


ORF00105 


SAG0103 


189 


conserved hypothetical protein 


ORF00106 


SAG0 104 


280 


conserved hypothetical protein 


ORF00107 


SAG0105 


427 


trigger factor 


ORF00108 


SAG0106 


191 


DNA-directed RNA polymerase, delta subunit, putative 


ORF00109 


SAG0 107 


534 


CTP synthase 


ORF00110 


SAG0108 


308 


conserved hypothetical protein 


ORF00111 


SAG0 109 


148 


deoxyuridine 5*-tri phosphate nucleotidohydrolase j 


ORF00112 


SAG0110 


454 


DNA repair protein RadA 


ORF00113 


SAG0111 


165 


carbonic anhydrase-related protein 


ORF00115 


SAG0112 


439 


pyridine nucleotide-disulphide oxidoreductase family 
protein 


ORF00116 


SAG0113 


484 


glutamyl-tRNA synthetase 


ORF00117 


SAG0114 


322 


ribose ABC transporter, periplasmic D-ribose-binding 
protein 
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Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



r\(2T ImO. 


oAuXXXX KeT NO. 


aa 


Annotation 


UrxrUU l | o 


oAbUl 1 0 


olU 


ribose ABC transporter, permease protein 


Ur\rUU I 1 y 


oAoUl 1 0 


4yz 


ribose ABC transporter, ATP-binding protein 


vrxrUU 


q a an a -\ 7 
OAoU J 1 / 


ToV 


ribose ABC transporter protein RbsD 


UrxrUU 1 £.\ 


oAv^U NO 


ono 


ribokinase 




oa -i n 
oACsUI iy 


328 


ribose operon repressor RbsR 


UKruu i£o 


o a t^n-i on 


32 


hypothetical protein 


UKrUU1/i4 


SAG0121 


362 


permease, putative 


UKrUUIZO 


o> A on<< oo 

SAG0 122 


228 


ABC transporter, ATP-binding protein 


vJKrU012o 


SAG0 123 


223 


DNA-binding response regulator 


UKrU012o 


1 SAG0 124 


356 


sensor histidine kinase 


vJKrUUI^y 


o a /""*n h on 


396 


argininosuccinate synthase 


UKrUOloO 


SAG0 126 


462 


argininosuccinate lyase 


UKrUUlol 


O A PnH 0*7 

oAGOIz/ 


293 


fructose-bisphosphate aldolase 


ORF00132 


SAG0128 


305 


L-2-hydroxyisocaproate dehydrogenase 


ORF00133 


SAG0 129 


62 


ribosomal protein L28 


ORF00134 


SAG0130 


121 


conserved hypothetical protein 


ORF00135 


SAG0131 


543 


DAK2 domain protein 


ORF00136 


SAG0 132 


294 


SPFH domain/Band 7 family protein 


ORF00137 


SAG0133 


38 


conserved hypothetical protein 


ORF00138 


SAG0 134 


i 96 


hypothetical protein 


ORF00141 


SAG0 135 


246 


amino acid ABC transporter, ATP-binding protein 


ORF00142 


SAG0 136 


516 


amino acid ABC transporter, amino acid-binding 
protein/permease protein 


ui\ruu i*to 


OnOU I O / 


OZ/ 


conserved hypothetical protein 




OMoU IOO 


y 


undecaprenol kinase, putative 


UrxrUU I*tO 


ortou 1 oy 


ZD 1 


negative regulator of competence MecA, putative 


ORF00148 


SAG0140 


386 


glycosyl transferase, group 4 family protein 


UKrU014y 


o A r~>r\ a a a 

SAG0 141 


256 


ABC transporter, ATP-binding protein 


ORFQ0150 


SAG0 142 


420 


conserved hypothetical protein 


ORF00151 


SAG0 143 


410 


selenocysteine lyase 


ORF00152 


SAG0144 


147 


NifU family protein 


ORF00153 


SAG0145 


472 


conserved hypothetical protein 


ORF00154 


SAG0146 


395 


penicillin-binding protein 4, putative 


ORF00155 


SAG0147 


411 


D-alanyl-D-alanine carboxypeptidase 


ORF00156 


SAG0148 


551 


oligopeptide ABC transporter, substrate binding protein, 
putative 


ORF00157 


SAG0 149 


304 


oligopeptide ABC transporter, permease protein j 


ORF00158 


SAG0150 


343 


oligopeptide ABC transporter, permease protein 


ORF00160 


SAG0151 


348 


oligopeptide ABC transporter, ATP-binding protein 


ORF00161 


SAG0 152 


310 


oligopeptide ABC transporter, ATP-binding protein 


iJKrUUI do 


o a r^r\ A co 


283 


4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


<JKrUUiD/ i 


oAIjU1o4 


147 


adc operon repressor AdcR 


OKrOOlDO 


SAG0 155 


236 


zinc ABC transporter, ATP-binding protein 


ORF00169 


SAG01 56 


270 


zinc ABC transporter, permease protein 


ORF00172 






tyrOoyl-lrxlN/A oyl UllfcJldofc; 


ORF00173 


SAG0159 


765 


penicillin-binding protein 1B, putative 


ORF00174 


SAG0160 


1191 


DNA-directed RNA polymerase, beta subunit 


ORF00176 


SAG0161 


1216 


DNA-directed RNA polymerase beta' subunit 


ORF00178 


SAG0162 


121 


conserved hypothetical protein I 


ORF00179 


SAG0163 


323 


competence protein CgIA 


ORF00180 


SAG0164 


282 


competence protein CgIB 


ORF00181 


SAG0165 


151 


conserved hypothetical protein 


ORF00182 


SAG0166 


123 


conserved domain protein 
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n.DC Dnf Ma 

ORr KeT NO. 


oAuXXXX KeT WO. 


33 


rtn n o tail o n 


L/KrUU iOo 


OnoU 1 D / 




r>nnoon/oH h\/nnthP k tipfil nrntf^in 
kaj i lofcii v tiu i iy pun icuL«ai pi uicii i 


UKrUU1o4 


o a r^ni fin 

OnOU I DO 


^Q7 


dlrCldlc Ml Idoc 


Ur\ruu I oo 


QAf^n-iRQ 


68 


trancrrrintinnal reaulator Crn/OI familv 


UKruu i o ( 


OnOv 1 / U 


4*5 


hv/nnth^tifsil nrntpin 

1 ly pULI ICllL>dl [Jl UlCII 1 


vJKrUUl Oo 


QAf^m 7-1 


1*51 


hv/nnthfitif^al nrntpin 
iiy [juii i tJiiucii [jiuiciii 


vJKrUU i oy 


CAOn-1 79 




nrntoQCD ni itiitiv/P* 
piutcaoc, puiaiivc 


/"^Dtrnrn on 
UKrUUiyU 


QA^ni 70 


9^R 




UKrUUiyi 


oAbul / 4 




y tUldl I ly l~cll 1 III lUjJcpuuaou 




0A0U 1 ( O 


7Q 




UKrUUI so 


QAfifM 7fi 
OnbU 1 / D 


Q4. 


LUI loci VtJU 1 iy \J\J *•! IC UOal yj\ ULCil 1 


ADcnrn QA 
(JKrUUiy4 


QAonH 77 
OnoU 1 / l 


107 


LI IIUI CUUAll 1 leal 1 III jr [Jiuiciii 


CJKrUUiyo 




9H8 


tRMA hinrlinn Hnmain nrAtpin 
vrxiNn uti lull iy uuniaiii ptuLciu 


ORF00196 


SAG0179 


238 


conserved hypothetical protein 


ORF00198 


SAG0 180 


131 


single-strand binding protein 


ORF00199 


SAG0 181 


*i14 


hydrolase, haloacid dehalogenase-like famiiy 


ORF00200 


SAG0 182 


581 


sensor histidtne kinase, putative 


ORF00201 


SAG0 183 


246 


response regulator 


ORF00203 


SAG0 184 


151 


conserved hypothetical protein 


ORF00204 


SAG0 185 


242 


membrane protein, putative 


ORF00205 


SAG0 186 


36 


hypothetical protein 


ORF00206 


SAG0 187 


542 


oligopeptide ABC transporter, oligopeptide-binding 
jroiein 


UKrUUzU/ 


oAuUioo 


OZ.O 


UliyuptJpUUc /ADO u dl lofJUl ttJI , pel 1 1 ICCJocT piuiciii 


UKrUUZuo 


OAfsni AQ 

0A0U i oy 


97^ 


Uliy UptJpLlUc nDw LI dl lopui lei , pCJI 1 1 icaoC pi\Jldll 


f\Dirnnono 


oMou i yu 


9R7 


ptJpuut; nDv n ai lopui ttsi , ni i uniunty piuiciti 




OnoU i y 1 


Z.UO 


pcLJllUC /ADw 11 dl lopUl LCI , n 1 1 "Mil iy (JIVJICMI 




oauu i yz 


R7R 
Of D 


r 1 O oyolclll, 1 InDO L»\JI 1 ipui ICI HO 


vJKrUU^ 1^: 




OH- 1 


alnha a m\/!acp famiK/ nrntpin 
dipild dlliyidot; Idrllliy piuicill 


/"*\Dirnn'"M /i 
VJKrUUZ14 


OnbU 1 y*f 


R^Q 
ooy 


li dl loi/i ipiiui idi dl nut; 1 1 1 in ictiui , tjy iv_? laiiiuy 


r\DCnno-i c? 
UKrUUZ id 


oMbu i yo 


^77 

Oil 


IO 1 JHO, lldlloLJUodod 


/-^Dcnn*^ *7 
UKrUUZl / 


OMbU i y D 


RR 


LrUMocI VcU UUIIIdlll piuicw 1 


UKrUuzIo 


oMbu i y / 


Q4 


r I o byotciii, 1 1 d uui 1 1 i t;i 1 1, puiduvc? 


OKrUlLiiy 


oMbu i yo 


4^1 


rIO oyoLClll, II w lAJM IpUl ItSI It, pULdllVC 




QAriniQQ 
oMbu i y y 




II dl lorxtsuJidoC, l>*-i.t3i iiiiiidi ouuuiiii 




OnbUi£UU 




tranclfotnlaQfa r^-tprminal <?iihitnit 




OnbUZU 1 


41Q 


UaIUUI cUUulaot7| jjutduvc; 




Onb UZ.Ui_ 


89 


riKn^nmal nrntpin R1fi 




qAf^nono 


70 Q 


nnlvrihnni irlpntidp nunlpntidvltransferase 


vJKrUUZ^O 


QArsnonA 


9^n 


rnnQPn/pH hvnnthptir*al nrntpin 
uui loci vcu nyjjuiiiciiucii piuivit i 




c A/^n^n 1 ^ 

OnbU^vJ 


194 


^prinp O-apptvltran^fprasp 


vJKruU^^o 


QAftin9nR 


Rfi 


h\/nr»thotir*al nrntpin 
1 tyjJLHI ICllOdi piwiciii 


HDCnAOOQ 

LJKrUU^zy 


OAoUZU / 


447 


ox/ctoinx/l-tRM A c\/nthptflQP 
oyoLtJii iy i~ir\i xn oyi in iciaoc 


\JKrUU*ioU 


OnbUZUO 


19« 


rnncon/pH hx/nnthptipal nrntpin 
UUliotJivcu i lypviii icui/di piuioiM 


ORF00231 


SAG0209 


251 


RNA methyltransferase, TrmH family, group 3 


ORF00232 


SAG0210 


172 


conserved hypothetical protein 


ORF00233 


SAG021 1 


286 


DegV family protein 


nr-> crrvr\ / ~)0 A 




*xo 

Oc. 


nypuLiicJiiOdi piuiwin 


ORF00235 


SAG0213 


39 


hypothetical protein 


ORF00236 


SAG0214 


148 


ribosomal protein L13 


ORF00237 


SAG0215 


130 


ribosomal protein S9 


ORF00238 


SAG0216 


33 


hypothetical protein 


ORF00239 


SAG0217 


384 


site-specific recombinase, phage integrase family 


ORF00240 


SAG0218 


158 


transcriptional regulator, Cro/CI family 


ORF00241 


SAG0219 


101 


hypothetical protein 
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ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00242 


SAG0220 


92 


conserved hypothetical protein 


ORF00243 


SAG0221 


76 


hypothetical protein 


ORF00244 


SAG0222 


108 


conserved domain protein 


ORF00245 


SAG0223 


209 


conserved hypothetical protein, fusion 


ORF00246 


SAG0224 


332 


replication initiation protein, putative 


ORF00247 


SAG0225 


144 


hypothetical protein 


ORF00248 


SAG0226 


418 


recombination protein 


ORF00249 


SAG0227 


156 


hypothetical protein 


ORF00250 


SAG0228 


111 


conserved hypothetical protein 


ORF00251 


SAG0229 


! 95 


conserved hypothetical protein 


ORF00252 


SAG0230 


96 


conserved hypothetical protein 


ORF00253 


SAG0231 


135 


hypothetical protein 


ORF00254 


SAG0232 


186 


hypothetical protein 


ORF00255 


SAG0233 


226 


hypothetical protein 


ORF00256 


SAG0234 


128 


hypothetical protein 


ORF00257 


SAG0235 


93 


hypothetical protein 


ORF00258 


SAG0236 


32 


hypothetical protein 


ORF00259 


SAG0237 


34 


hypothetical protein 


ORF00260 


SAG0238 


41 


hypothetical protein 


ORF00261 


SAG0239 


286 


transcriptional regulator MutR family 


ORF00262 


SAG0240 


393 


transporter, putative 


ORF00263 


SAG0241 


213 


amino acid ABC transporter, permease protein 


ORF00264 


SAG0242 


308 


amino acid ABC transporter, amino acid-binding 
protein 


ORF00265 


SAG0243 


211 


amino acid ABC transporter, permease protein 


ORF00266 


SAG0244 


381 


amino acid ABC transporter, ATP-binding protein 


ORF00272 


SAG0245 


152 


hypothetical protein 


ORF00273 


SAG0246 


268 


hypothetical protein 


ORF00274 


SAG0247 


116 


hypothetical protein 


ORF00275 


SAG0248 


90 


hypothetical protein 


ORF00276 


SAG0249 


116 


hypothetical protein 


ORF00278 


SAG0250 


193 


hypothetical protein 


ORF00279 


SAG0251 


72 


transcriptional regulator, Cro/CI family 


ORF00280 


SAG0252 


186 


acetyltransferase, GNAT family 


ORF00281 


SAG0253 


192 


acetyltransferase, GNAT family 


ORF00282 


SAG0254 


226 


acetyltransferase, GNAT family ! 


ORF00283 


SAG0255 


315 


conserved hypothetical protein 


ORF00284 


SAG0256 


163 


RNA polymerase sigma factor, ECF subfamily 


ORF00285 


SAG0257 


53 


hypothetical protein 


ORF00287 


SAG0258 


202 


transcriptional regulator, TetR family 


ORF00288 


SAG0259 


365 


ABC transporter efflux protein, DrrB family, putative 


ORF00289 


SAG0260 


238 


ABC transporter, ATP-binding protein 


ORF00290 


SAG0261 


129 


IS1381, transposase OrfB 


ORF00291 


SAG0262 


127 


IS1381, transposase OrfA 


ORF00292 


SAG0263 


171 


hypothetical protein 


ORF00293 


SAG0264 


103 


conserved hypothetical protein 


ORF00294 


SAG0265 


235 


conserved hypothetical protein 


ORF00295 


SAG0266 


382 


N-acetyiglucosamine-6-phosphate deacetylase 


ORF00296 


SAG0267 


180 


conserved hypothetical protein 


ORF00297 


SAG0268 


304 


glycyl-tRNA synthetase, alpha subunit 


ORF00298 


SAG0269 


213 


acyj carrier protein phosphodiesterase, putative 


ORF00299 


SAG0270 


679 


glycyl-tRNA synthetase, beta subunit 


ORF00300 


SAG0271 


85 


conserved hypothetical protein 


ORF00301 


SAG0272 


87 


membrane protein, putative 
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ORF Pfvf Nn 


oAuXXXX KeT NO. 


aa 


Annotation 


L/rxruuou^ 




cno 
DUZ 


glycerol kinase 


ur\rUUOUO 


O A rTi07y) 


ouy 


alpha-glycerophosphate oxidase 


UrxrllUOU't 


oAbUii / O 


ZoZ 


glycerol uptake facilitator protein 


nRFnn^o^ 

uf\ruuouv) 




440 


NADH oxidase, putative 


UrxrUUOUO 


0APA077 


4/o 


conserved hypothetical protein 


nppnnin7 

UKrUUoU/ 




661 


transketolase 






101 


conserved hypothetical protein 


ORF00309 


SAG0280 


244 


ABC transporter, ATP-binding protein 


ORF00310 


SAG0281 


534 


membrane protein, putative 


ORF00313 


SAG0282 


461 


PTS system, NBC components 


ORF00314 


SAG0283 


267 


glutamate 5-kinase 


ORF00315 


SAG0284 


417 


gamma-glutamyl phosphate reductase 


ORF00316 


SAG0285 


298 


conserved hypothetical protein TIGR00006 


ORF00317 


SAG0286 


108 


cell division protein FtsL, putative 


ORF00318 


SAG0287 


752 


penicillin-binding protein 2X 


ORF00319 


SAG0288 


336 


phospho-N-acetyimuramoyl-pentapeptide-transferase 


ORF00320 


SAG0289 


447 


ATP-dependent RNA helicase, DEAD/DEAH box 
family 


\Jr\r\J\JOdZ 1 


GA/Tkoan 
oAoU^yU 




ABC transporter, substrate-binding protein 


ORPfin^oo 

UrxrUUdZZ 




OCT 


amino acid ABC transporter, permease protein 




oAouzyz 


OAT 


amino acid ABC transporter, ATP-binding protein 


UrxrUUOt'f 


GAf^nOQQ 

oMuuzyo 


7 A 
f 4 


conserved hypothetical protein 


ORFfin^OR 
V-T\rUUOZO 


oAbUZW 


•3U4 


thioredoxin reductase 


UrvrUUOitD 




Add 
400 


conserved hypothetical protein 


UrvrUUOZf 


oAouzyo 




NAD synthetase 






AAA 
444 


aminopeptidase C 




oAbU^yo 


/ oo 


penicillin-binding protein 1A 


UfxrUUOOU 


oMv^u^yy 


iyy 


recombination protein U 


ORF00331 


SAG0300 


172 


conserved hypothetical protein j 


- ORF00332 


SAG0301 


40 


hypothetical protein 


ORF00333 


SAG0302 


110 


conserved hypothetical protein 


ORF00335 


SAG0303 


384 


conserved hypothetical protein 


ORF00336 


SAG0304 


487 


conserved hypothetical protein 


ORF00337 


SAG0305 


160 


autoinducer-2 production protein LuxS 


ORF00338 


SAG0306 


535 


KH domain protein 


ORF00340 


SAG0307 


33 


hypothetical protein 


ORF00341 


SAG0308 




ABC transporter, ATP-binding protein, FRAMESHIFT 


ORF00343 


SAG0309 


246 [ 


ABC transporter, permease protein, putative 


ORF00344 


SAG0310 


361 


conserved hypothetical protein 


ORF00345 


SAG031 1 




DNA-binding response regulator POINT MUTATION 


ORF00347 


SAG0312 


234 


conserved hypothetical protein 


ORF00348 


SAG0313 


209 


guanylate kinase 


ORF00349 


SAG0314 


104 


DNA-directed RNA polymerase, omega subunit, 
putative 


ORF00350 


SAG0315 


796 


primosomal protein N* 


ORF00351 


SAG0316 


311 


methionyl-tRNA formyltransferase 


ORF00352 


SAG0317 


440 


Sun protein 


ORF00353 


SAG0318 


245 


serine/threonine phosphatase, putative 


ORF00354 


SAG0319 


651 


serine/threonine protein kinase 


ORF00355 


SAG0320 


231 


conserved hypothetical protein 


ORF00356 


SAG0321 


339 


sensor histidine kinase, putative 


ORF00358 


SAG0322 


213 


DNA-binding response regulator 
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ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00359 


SAG0323 


466 


hydrolase, haloacid dehalogenase family/peptidyl- 
prolyl cis-trans isomerase, cyclophilin type 


urxruuoou 




A OA 


general stress protein, putative 


wrxruuoo 1 


QAf^n^OR 




pyruvate formate-lyase-activating enzyme 




OnbUOtD 




transcriptional regulator, DeoR family 


VJRPUUOOo 


QAfSfV*97 


007 


transcriptional regulator, putative 


Ur\rUUo04 


OMoUo/O 


i ri7 

TU/ 


PTS system, cellobiose-specific IIA component 


nocrnniftft 
UKrUUooo 


oAoUozy 


1 Ub 


PTS system, cellobiose-specific IIB component 


(JKrUUob/ 


oACjUooU 


433 


PTS system, cellobiose-specific liC component 


LJKrUUobo 


oACjUooI 


Q*f O 

olo 


formate acety transferase 


\JKrUUoby 


oAbUooz 




transaldolase family protein 


UKrUUo/1 


oAoUooo 


ob2 


glycerol dehydrogenase 


QKr00372 


SAG0334 


308 


cysteine synthase A 


vJKrUUo/o 


oAtaUooO 


Z14 


conserved hypothetical protein TIGR00257 


UKrUUo / 


oAbUoOD 


4zy 


helicase, putative 


UKrUUo 


OAPn*JQ7 

oAbUoo ( 




competence protein F, putative 


UKrUUo ( o 


oAoUooo 


•i QA 

1o4 


ribosomal subunit interface protein 




oAvjUooa 


Add 
4oU 


aspartate kinase family protein 


UKrUUooo 


oAbUo4U 


Zlo 


hydrolase, haloacid dehaiogenase-like family 


Ur\ruUoo4 


oAbUo41 


4y 


hypothetical protein 


ORF00385 


SAG0342 


263 


enoyl-CoA hydratase/isomerase family protein 


ORF00386 


SAG0343 


144 


transcriptional regulator, MarR family 


ORF00387 


SAG0344 


323 


3-oxoacyl-(acyl-carrier-protein) synthase III 


ORF00388 


SAG0345 


74 


acyl carrier protein 


ORF00390 


SAG0346 


319 


enoyl-(acyl-carrier-protein) reductase II 


ORF00391 


SAG0347 


308 


malonyl CoA-acyl carrier protein transacylase 


ORF00392 j 


SAG0348 


244 


3-oxoacyl-[acyl-carrier protein] reductase 


ORF00393 


SAG0349 


410 


3-oxoacyl-(acyl-carrier-protein) synthase il 


ORF00394 | 


SAG0350 


166 


acetyl-CoA carboxylase, biotin carboxyl carrier protein 


ORF00395 


SAG0351 


140 


(3R)-hydroxymyristoyl-(acyl-carrier-protein) 
dehydratase 


ORF00396 


SAG0352 


456 


acetyl-CoA carboxylase, biotin carboxylase 


ORF00397 


. SAG0353 


291 


acetyl-CoA carboxylase, carboxyl transferase, beta 
subunit 


UKrQUo9o 


SAG0354 


257 


acetyl-CoA carboxylase, carboxyl transferase, alpha 

*> 1 1 I**m mif 

suuunii 


wr\r UU099 


OMoUOOO 


Z 1 U 


conserved hypothetical protein 




OrtvjUOOD 




seryl-tRNA synthetase 






oou 


nypoineiicai protein 


ORF00403 


SAG0358 


120 


conserved hypothetical protein 


ORF00404 


SAG0359 


303 


PTS system, mannose-specific IID component 


OKrUU4U5 


SAG0360 


270 


PTS system, mannose-specific IIO component 


tJKrUU4Uo 


SAG0361 


336 


PTS system, mannose-specific IIAB components 


ORF00407 


SAG0362 


270 


hydrolase, haloacid dehalogenase-like family j 


ORF00408 


SAG0363 


194 


hypothetical protein 


ORF00409 


SAG0364 


203 


membrane protein, putative 


<JKrUU41U 


oACjUoOO 


473 


xanthine/uracil permease family protein 


ORF0041 1 


SAG0366 


169 


conserved hypothetical protein TIGR00150, putative 


ORF00412 


SAG0367 


186 


acetyltransferase, GNAT family 


ORF00413 


SAG0368 


435 


transcriptional regulator, putative 


ORF00414 


SAG0369 


98 


conserved hypothetical protein i 


ORF00415 


SAG0370 


139 


HIT family protein 


ORF00416 


SAG0371 


167 


hypothetical protein 


ORF00417 


SAG0372 


85 


hypothetical protein j 
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ORF Ref No. 


SAGxxxx Ref No. 


aa 
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ORF00419 


SAG0373 


241 


ABC transporter, ATP-binding protein 


ORF00421 


SAG0374 


344 


ABC transporter, permease protein 


ORF00422 


SAG0375 


266 


conserved hypothetical protein 


ORF00423 


SAG0376 


211 


conserved hypothetical protein T1GR00091 


ORF00424 


SAG0377 


127 


conserved hypothetical protein, POINT MUTATION 


ORF00425 


SAG0378 


379 


N utilization substance protein A 


ORF00426 


SAG0379 


98 


conserved hypothetical protein 


ORF00427 


SAG0380 


100 


ribosomal protein L7A family 


ORF00428 


SAG0381 


927 


translation initiation factor IF-2 


ORF00429 


SAG0382 


122 


ribosome-binding factor A 


ORF00430 


SAG0383 


334 


conserved hypothetical protein 


ORF00431 


SAG0384 


138 


transcriptional repressor CopY 


ORF00432 


SAG0385 


744 


copper-transporter ATPase CopA 


ORF00433 


SAG0386 


68 


copper-transporter protein CopZ 


ORF00434 


SAG0387 


204 


conserved hypothetical protein 


ORF00435 


SAG0388 


270 


hydrolase, haloacid dehalogenase-like family 


ORF00436 


SAG0389 


880 


DNA polymerase I 


ORF00437 


SAG0390 


146 


CoA binding domain protein 


ORF00438 


SAG0391 


159 


transcriptional regulator, Fur family 


ORF00439 


SAG0392 


521 


cell wall surface anchor family protein 


ORF00440 


SAG0393 


228 


DNA-binding response regulator 


/-^ t— * r\r\ AAA 

ORF00441 


SAG0394 


345 


sensor histidine kinase 


ORF00442 


SAG0395 


246 


conserved hypothetical protein 


ORF00443 


SAG0396 


380 


queuine tRNA-ribosyltransferase 


ORF00444 


SAG0397 


102 


conserved hypothetical protein 


ORF00445 


SAG0398 


179 


bioY family protein 


ORF00446 


SAG0399 


258 


AtsA/ElaC family protein 


ORF00447 


SAG0400 


168 


cytidine/deoxycytidylate deaminase family protein 


ORF00448 


SAG0401 


44 


hypothetical protein 


ORF00449 


SAG0402 


449 


glucose-6-phosphate isomerase 


ORF00450 


SAG0403 


175 


5-formyltetrahydrofolate cyclo-ligase family protein 


ORF00451 


SAG0404 


225 


rhomboid family protein 


ORF00452 


SAG0405 


347 


lipoprotein 


ORF00453 


SAG0406 


299 


UTP-glucose-1 -phosphate uridylyltransferase 


ORF00454 


SAG 0407 


338 


glyceroi-3-pnospnate dehydrogenase (inau(P)+j 


ORF00455 


f\ a /">/-k inn 

SAG0408 


109 


ribonuclease P protein component 


ORF00456 


SAG0409 


271 


SpolllJ family protein 


ORF00458 


SAG0410 


273 


R3H domain protein 


ORF00463 


SAG0411 


177 


conserved hypothetical protein 


ORF00464 


SAG0412 


258 


RecX protein 


ORF00465 


SAG0413 


451 


RNA methyltransferase, TrmA family 


ORF00466 


SAG0414 


153 


conserved hypothetical protein 


ORF00467 


SAG0415 


142 


acetyltransferase, GNAT family 


ORF00468 


SAG0416 


j 1233 


protease, putative 


I ORF00469 


SAG0417 


! 302 


glycosyl transferase, group 2 family protein 


ORF00470 


SAG0418 


336 


ribonucleoside-diphosphate reductase 2, beta subunit 


ORF00471 


SAG0419 


137 


nrdl protein 


ORF00472 


SAG0420 


721 


ribonucleoside-diphosphate reductase 2, alpha subunit 


ORF00473 


SAG0421 


i 1055 


conserved hypothetical protein 


ORF00474 


SAG0422 


129 


conserved hypothetical protein ^ 


ORF00475 


SAG0423 


132 


conserved domain protein 
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ORF00476 


SAG0424 


94 


hypothetical protein 


ORF00478 


SAG0425 


105 


carboxymuconolactone decarboxylase family protein 


ORF00479 


SAG0426 


131 


conserved hypothetical protein 


ORF00480 


SAG0427 


129 


transcriptional regulator, MerR family 


1 ORF00482 


SAG0428 


345 


alcohol dehydrogenase, zinc-containing j 


ORF00483 


SAG0429 


284 


oxidoreductase, aldo/keto reductase family 


| ORF00484 


SAG0430 


287 


cation efflux system protein 


ORF00485 


SAG0431 


174 


transcriptional regulator, TetR family 


ORF00486 


SAG0432 


397 


transcriptional regulator, AraC family 


ORF00487 


SAG0433 


1389 


surface protein Rib 


ORF00488 


SAG0434 


61 


transposase, IS256 family, truncation 


ORF00489 


SAG0435 


97 


DNA-damage-inducible protein J, putative 


ORF00490 


SAG0436 


62 


hypothetical protein 


ORF00491 


SAG0437 


123 


hypothetical protein 


ORF00493 


SAG0438 


145 


bacteriophage L54a, integrase, truncation 


ORF00495 


SAG0439 




conserved hypothetical protein, FRAMESHIFT 


ORF00496 


SAG0440 


84 


conserved hypothetical protein 


ORF00497 


SAG0441 


103 


conserved domain protein 


ORF00499 


SAG0442 


189 


acetyltransferase, GNAT family 


ORF00500 


SAG0443 


194 


acetyltransferase, GNAT family 


ORF00501 


SAG0444 


188 


conserved hypothetical protein 


ORF00502 


SAG0445 


883 


valyi-tRNA synthetase 


ORF00503 


SAG0446 


319 


oxidoreductase, Gfo/ldh/MocA family 


j ORF00504 


SAG0447 


287 


magnesium transporter, CorA family 


ORF00506 


SAG0448 


391 


transposase, IS256 family 


ORF00507 


SAG0449 


354 


conserved hypothetical protein 


ORF00508 


SAG0450 


330 


aspartate— ammonia ligase 


ORF00510 


SAG0451 


149 


bacteriocin transport accessory protein.putative 


ORF00511 


SAG0452 


179 


type II DNA modification methyltransferase, putative 


ORF00512 


SAG0453 


96 


hypothetical protein 


ORF00513 


SAG0454 


161 


phosphopantetheine adenylyltransferase 


ORF00515 


SAG0455 


357 


conserved hypothetical protein 


ORF00518 


SAG0456 




conserved hypothetical protein, degenerate 


ORF00519 


SAG0457 


192 


conserved hypothetical protein 


ORF00520 


SAG0458 


368 


conserved hypothetical protein TIGR00048 


| ORF00521 


SAG0459 


171 


VanZF domain protein 


ORF00522 


SAG0460 


581 


ABC transporter, ATP-binding/permease protein 


ORF00523 


SAG0461 


579 


ABC transporter, ATP-binding/permease protein 


ORF00524 


SAG0462 


188 


anthranilate synthase component ii 


ORF00525 


SAG0463 


179 


bioY family protein 


ORF00526 


SAG0464 


330 


biotin synthetase 


ORF00527 


SAG0465 


164 


hypothetical protein 


ORF00528 


SAG0466 


371 


thiolase 


ORF00531 


SAG0467 


409 


AMP-binding enzyme domain protein 


ORF00532 


SAG0468 


210 


endonuclease Ml 


ORF00533 


SAG0469 


131 


type IV prepiiin peptidase-related protein 


! ORF00534 


SAG0470 


69 


conserved hypothetical protein 


ORF00535 


SAG0471 


322 


glucokinase 


ORF00536 


SAG0472 


126 


rhodanese domain protein 


ORF00537 


SAG0473 


613 


elongation factor Tu family protein 


ORF00538 


SAG0474 


81 


conserved hypothetical protein 


ORF00540 


SAG0475 


451 


UDP-N-acetylmuramoylalanine-D-glutamate ligase 



671 



WO 2004/018646 PCT/US2003/026827 

Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00541 


SAG0476 


358 


UDP-N-acetylglucosamine-N-acetylmuramyl- 
(pentapeptide) pyrophosphoryl-undecaprenol N- 

ai^c Ly ly I UL»Uodl 1 III It? LI dl ioici doc 


ORF00542 


SAG0477 


378 


fspll riivi«;ir»n nrot<=»in DivlB nntstivA 

"CII UIVIOIL/ll LJIUldll L/1VILJ, LJULCtllVO 




SAG0478 


429 


isCii uivioiuil jJiuidii no/A 




SAG047Q 


426 


ppII riiviQirm nrot^in Ft^7 

uCI 1 VJIVIOiUII fJIwLdll 1 lOi. 


ORF00546 


SAG0480 


224 


ylmE protein, putative 


UKrUUo4/ 


oAbU4o1 




ylmF protein 


vJKrUU04o 


oA(jU4o^ 


O A 

o4 


YGGT family protein 


UKrUU049 


oA(jU4oo 


262 


ylmH protein 


UKrOOooO 


O A f*f\A OA 

oACp04o4 


256 


cell division protein Divl VA, putative 


ORrU0552 


oACjU4oo 


930 


isoleucyl-tRNA synthetase 


ORF00553 


SAG0486 


100 


conserved hypothetical protein 


ORF00554 


SAG0487 


151 


MutT/nudix family protein 


ORF00555 


SAG0488 


753 


ATP-dependent Clp protease, ATP-binding subunit 


ORF00556 


SAG0489 


34 


hypothetical protein 


ORF00557 


SAG0490 


76 


conserved hypothetical protein 


ORF00558 


SAG0491 


230 


amino acid ABC transporter, permease protein 


ORF00559 


SAG0492 


244 


amino acid ABC transporter, ATP-binding protein 


ORF00560 


SAG0493 


564 


phosphoglucomutase/phosphomannomutase family 
protein 


ORF00562 


SAG0494 


284 


methylenetetrahydrofolate 
dehydrogenase/methenyltetrahydrofolate 

L-yLrtui lyui Uldoc 


ur\ruu3oo 




97P 


uui loci vtju i lypoii itHiucu proi@in 








exodeoxyri bo nuclease VII, large subunit 


UfxrUuOuJ 


<3AGf)4Q7 


71 


uAUUcUAyiiuuFiuoicdbc VII, oiiictJi ouuunii 


ORFnn^Rfi 

UrxrUUJUU 


0/Av_JVJ*T C7CJ 


2Q0 


ytJfcii lyiirdi loUdnbicir doc, putative 




^AGf)4QQ 

0/"\ O U *r 57 57 


£- i O 


homnl\/cin A 
lien njiyoiii r\ 


ORFOO'SfiR 
urxruujuo 




1*57 


diyniuic icjJicooui r\\ y i\, puiaUVc 








Pi M A rAtMair nrotoin RorM 


ORF00571 




P7R 


rionX/ fssmilv nrntoin 
ucy v ictiiiiiy piuicin 






279 


1 ir"!S>Q£*/Ar > Y/lh\/rlrY"ilctQC» ni if ati\/ci 
L_tLJav>t;//Av^yn lyui uiaoc, jjutduvt; 




SAG0SD4 




bUIIOCI VCU 1 lypULl iCllUdl piLHtNH 




^AGO^f)^ 


Q1 


LJivrv'uu tun iy ljiuicjiii nu 




QAG0*S0R 

OrAVj w vj L/ u 


65 


J lyLJUtl lt2UL>al piUlClll 






310 


Hihv/Hrr>r»rntoto HohuHrnnonaco A 
uii lyuiuui uiaLc uciiy ui uy ci idoc r\ 


nRFnn*s77 i 


OnOUJUO 


41 1 


Ucld IduLdlll l tJololdl IOC ldL>LUI 


ORF00^7fl 


SAGfVsnQ 

0/"\v3v»Jlv57 


403 


UCld'IdUldl 1 1 I COIOtdl IdUlLrl 


wr\i UUJ / 57 






illUIlVI piuiclil, pULdUVc 


ORF00580 ; 


SAG0511 


270 


hydrolase, haloacid dehalogenase-like family 


UKrUUool 


o a fnc-f o 
oACjUoIz! 


4oo 


HD domain protein 


ORF00582 


SAG0513 


128 


conserved hypothetical protein 


ORF00583 


SAG0514 


894 


cation-transporting ATPase, E1-E2 family 


ORF00584 


SAG0515 


286 


conserved hypothetical protein 


ORF00585 


SAG0516 


643 


fructose-1 ,6-bisphosphatase, putative 


ORF00586 


SAG0517 


374 


iron-sulfur cluster-binding protein, putative 


ORF00587 


SAG0518 




peptide chain release factor 2, FRAMESHIFT 


ORF00588 


SAG0519 


230 


cell division ABC transporter, ATP-binding protein FtsE 


ORF00589 


SAG0520 


309 


cell division ABC transporter, permease protein FtsX 


ORF00590 


SAG0521 


236 


carboxymethylenebutenolidase-related protein 


ORF00591 


SAG0522 


232 


metallo-beta-lactamase superfamily protein 
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ORF00592 


SAG0523 


254 


oxidoreductase, short chain dehydrogenase/reductase 
family \ 


ORF00593 


SAG0524 


835 


DNA polymerase ill, epsilon subunit/ATP-dependent 
nelicase uirnj 


t-Jrxruuoyo 


oAoUOZO 




aspartate aminotransferase 


Ur\rUUOyD 




44 o 


asparaginyl-tRNA synthetase 


ORF00597 


SAG0527 


185 


conserved hypothetical protein 


ORF00598 


SAG0528 


327 


inosine-uridine preferring nucleoside hydrolase 


ORF00599 


SAG0529 


38 


hypothetical protein 


ORF00600 


SAG0530 


137 


OsmC/Ohr family protein 


ORF00601 


SAG0531 


296 


conserved hypothetical protein 


ORF00602 


SAG0532 


324 


conserved hypothetical protein 


ORF00603 


SAG0533 


303 


Uncharacterized BCR, COG1481 


ORF00604 


SAG0534 


465 


dipeptidase 


ORF00605 


SAG0535 


506 


zinc ABC transporter, zinc-binding adhesion iiprotein 


UKrUUbUo 




86 


ribosomal protein L31 


ORF00607 


SAG0537 


311 


DHH family protein 


ORF00608 


SAG0538 


340 


adenosine deaminase, putative 


ORF00609 


SAG0539 


147 


flavodoxin 


ORF00610 


SAG0540 


91 


chorismate mutase, putative 


ORF0061 1 


SAG0541 


398 


voltage-gated chloride channel family protein 


UKrUUblz 


SAG0542 


127 


IS1381, transposase OrfA 


ORF00613 


SAG0543 


129 


IS1381, transposase OrfB 


ORF00614 


SAG0544 


115 


ribosomal protein L19 


ORF00615 


SAG0545 


359 


site-specific recombinase, phage integrase family 


ORFnftft17 
Ur\ruUO 1 / 


OAuU04D 


R7 
Of 


_ : . 

conserved domain protein 


ORFftHRIft 




loo 


nypotneticai protein 


UrxrUUO \ \J 


OnbU040 


orr 
zoo 


repressor protein, putative 


nRFnnR9n 
wr\ruuozu 


oAoUQ4y 


4 f 


hypothetical protein 




OHoUOOU 


7A 
f 4 


conserved hypothetical protein 


ORFnnK9C> 




CO 

OZ 


conserved hypothetical protein 


UrxrUUOtO 




R9 
OZ 


hypothetical protein 


UJr\rUUOZ4 




ZOO 


hypothetical protein 




oAoUOD4 


DO 


transcriptional regulator, Cro/CI family 


HRPAn«97 
VJr\rUUOZ / 


QAOnWR 
oAvjUOOO 


Z4y 


antirepressor, putative 




OAtjUOOO 


A "7 
4/ 


hypothetical protein 


ORFnnR^n 


OAVjUOO / 


I O 


hypothetical protein 


HRPflflft^O 


CAnnKRfl 1 


"7 A 
/4 


hypothetical protein 


ORFnnfi'*'* 
wrxruuooo 


QAr*n*\RQ 
oAbuooy 


Zoo 


conserved hypothetical protein 




oAbUODU 


77 


conserved hypothetical protein 


V-/rxrUUDoO 


G Af^AKfJ-l 
oAVjUOOl 


4b 


hypothetical protein 


UrxrUUDOD 


oAvpUOOZ 


o4 


hypothetical protein 


ORF00637 


SAG0563 


53 


hypothetical protein 


ORF00638 


SAG0564 


160 


conserved hypothetical protein 






ZZ*+ 


conserved domain protein 


ORF00640 


SAG0566 


138 


single-strand binding protein 


ORF00641 


SAG0567 


439 


reverse transcriptase/maturase family protein 


ORF00642 


SAG0568 


67 


conserved hypothetical protein 


ORF00643 


SAG0569 


158 


conserved hypothetical protein 


ORF00644 


SAG0570 


115 


hypothetical protein 


ORF00645 


SAG0571 


43 


hypothetical protein 


ORF00646 


SAG0572 


138 | 


conserved hypothetical protein 


ORF00647 


SAG0573 


54 ! 


hypothetical protein 
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ORF00648 


SAGO 574 


89 


conserved hypothetical protein 


ORF00649 


SAG0575 


110 


hypothetical protein 


ORF00650 


SAGO 576 


43 


hypothetical protein 


ORF00652 


SAG0577 


177 


conserved hypothetical protein 


ORF00653 


SAG0578 


88 


conserved hypothetical protein 


ORF00654 


SAG0581 


118 


conserved hypothetical protein 


ORF00655 


SAG0582 


422 


conserved hypothetical protein 


ORF00656 


SAG0583 


406 


conserved hypothetical protein 


ORF00657 


SAG0584 


62 


conserved hypothetical protein, truncation 


ORF00658 


SAG0585 


471 


conserved hypothetical protein 


ORF00659 


SAG0586 


154 


conserved hypothetical protein 


ORF00660 


SAG0587 


300 


structural protein, putative 


ORF00661 


SAG0588 


71 


conserved hypothetical protein 


ORF00662 


SAG0589 


143 


conserved hypothetical protein 


ORF00663 


SAG0590 


112 


conserved hypothetical protein ] 


ORF00664 


SAG0591 


78 


conserved hypothetical protein 


ORF00665 


SAGO 592 


111 


conserved hypothetical protein 


ORF00666 


SAGO 593 


185 


structural protein 


ORF00667 


SAG0594 


81 


conserved hypothetical protein 


ODCnnooo 

ORrOUDDo 


SAG0595 


123 


conserved hypothetical protein 


ADCArtClJft 

OKrQ0oo9 


SAG0596 


670 


PblA, internal deletion 


UKrUUb f U 




506 


minor structural protein, putative 


OKr00b71 


SAG0598 


1374 


minor structural protein, putative 


OKruOD/2 


SAGO 599 


668 


minor structural protein, putative 


ORF00673 


SAG0600 


109 


hypothetical protein 


ORF00674 


SAG0601 


70 


hypothetical protein 


ORF00675 


SAG0602 


100 


conserved hypothetical protein 


ORF00676 


SAG0603 


111 


holin, putative 


ORF00677 


SAG0604 


239 


lysin, putative 


ORF00678 


SAG0605 


323 


conserved hypothetical protein 


ORF00679 


SAG0606 


66 


conserved hypothetical protein 


ORF00681 


SAG0607 


56 


conserved hypothetical protein 


ORF00682 


SAG0608 


59 


hypothetical protein 


ORF00683 


SAG0609 


193 


site-specific recombinase, phage integrase family 


ORF00685 


SAG0610 


134 


conserved hypothetical protein 


ORF00687 


SAG0611 




transposase, degenerate FRAMESHIFT 


ORF00689 


SAG0612 


53 


conserved hypothetical protein, FRAMESHIFT 


ORF00690 


SAG0613 


425 


transmembrane protein Vexpl 


ORF00691 


SAG0614 


218 


ABC transporter, ATP-binding protein Vexp2 


ORF00692 


SAG0615 


458 


transmembrane protein Vexp3 


ORF00693 


SAG0616 


217 


DNA-binding response regulator VncR 


ORF00694 


SAG0617 


439 


sensor histidine kinase VncS ! 


ORF00695 


SAG0618 


195 


transposase OrfB, I S3 family, truncation 


ORF00697 


SAG0619 


66 


conserved hypothetical protein 


ORF00698 


SAG0620 


62 


hypothetical protein 


UKrUUo99 


SAG0621 


401 


rod shape-determining protein RodA, putativeO 


ORF00700 


SAG0622 


186 


hydrolase, haloacid dehalogenase-like family 


ORF00701 


SAG0623 


650 


DNA gyrase, B subunit 


ORF00702 


SAG0624 


574 


septation ring formation regulator EzrA, putative 


ORF00703 


SAG0625 


213 


phosphoserine phosphatase SerB 


ORF00704 


SAG0626 


161 


MutT/nudix family protein 


ORF00705 


SAG0827 


151 


conserved hypothetical protein 


ORF00706 


SAG0628 


435 


enolase 


ORF00707 


SAG0629 


354 


conserved domain protein 
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ORF0070R 


^AGOfiSO 


427 


3-phosphoshikimate 1 -carboxyvinyltransferase 


ORF0070Q 


OAVjUUu 1 


170 i 


<:hikimate kinase 


nRFnn7in 

k^Pvl VJU 1 1 \J 


<3AG0632 


457 


nor nrotpin 


ORF0071 1 


SAG0633 

VJAV3V/U O O 


451 


RNA methy {transferase, TrmA family 


ORF00712 


SAG0634 


70 


hypothetical protein 


! ORF00713 




OA Pi 
<c4o 


acio pnospriciLdoe prtsourour, class d 


QRFQ0714 


oACjUood 


•\ 70 


conservea nypouieiiocH proiein 


ORF00717 


CAPf»CQ7 




iranscripuoricii rcyuieiuji, i cir\ Jainiiy, puiduvt;, 
FRAMF^WIFT 


UKrUU/io 






r>cA 1 wall qi irfor*f^ nnfhnr familv nmt^in 

Gfcill Wall oul IdOC alioi IUI I all my piutCll) 


UKrUUrZU [ 




£ f O 


lldl IbpUodoC Ui ID, IOv laillliy 


ADCAn70-1 

UKrUUY £.\ 


oAoUDnU 




troncnncaco OrfA 1*^*^ family/ 
lldliopUodoc Ul In, IOO laillliy 


r\Dcnn700 
UKrUU/ZZ 


onoUD4 1 




Tn*J9C|9 Orf *1f) nrnti^in rlpnpnprntp POIMT MUTATION 


ORF00723 


SAGO 642 


59 


hypothetical protein 


ORF00725 


SAG0643 

wr\\J w w~ w 




chaperonin, 33 kDa DEGENERATE 


ORF00726 

wIM \J\J i «•» w 


SAG0644 


402 


transcriptional regulator, AraC family 


ORF00727 


SAG0645 


554 


cell wall surface anchor family protein, putative 


ORF00728 


SAG0646 


307 


cell wall surface anchor family protein \ 


ORF00729 


SAG0647 


305 


sortase family protein 


ORF00731 

XI ww 1 w 1 


SAG0648 


260 


sortase family protein 


ORF00732 


SAG0649 


890 


cell wall surface anchor family protein .putative 


ORF00734 


SAG0650 


189 


sortase family protein, FRAMESHIFT 


ORF00735 


SAG0651 

w# » V-/ W W W 1 


201 


hypothetical protein 


ORF00737 ! 


SAG0653 


76 


conserved hypothetical protein, DEGENERATE 


ORF00738 


SAG0654 


34 


hypothetical protein 


ORF00740 

\-J l V \ \J\J 1 ~T\J 


SAG0656 


36 


hypothetical protein 


ORF00741 

WlM ww I ~T 1 


SAG0657 


89 


hypothetical protein 


ORF00742 


SAG0658 


383 


lipoprotein, putative 


ORF00743 

V-/* XI ww t ~w 


SAG0659 


330 


ABC transporter, ATP-binding protein 


ORF00744 


SAG0660 


272 


membrane protein j 


ORF00745 


SAG0661 


261 


conserved hypothetical protein 


ORF00747 

V— / 1 XI WW I I * 


SAG0663 


282 


cylD protein 


ORF00748 

XI w W f ~w 


SAG0664 


240 


cylG protein 


ORF00749 

Wl M W W # TW 


SAG0665 


101 


acyl carrier protein AcpC j 


ORF00750 


SAG0666 


158 


cylZ protein FRAMESHIFT 


ORF00751 


SAG0667 


309 


cylA protein 


ORF00752 

wl M W W # 


SAG0668 


292 


cylB protein 


ORF00753 


SAG0669 


667 


cylE protein j 


ORF00754 


SAG0670 


317 


cylF protein 


ORF00755 


SAG0671 


731 


cyll protein 


ORF00756 


SAG0672 


403 


cylJ protein 


ORF00757 

1 XI w w/ 1 w 1 


SAG0673 


191 


cylK protein 


! ORF00758 


SAG0674 


113 


hypothetical protein 


ORF00759 


SAG0675 


171 


surface protein antigen-related protein 


ORF00760 


SAG0676 


ooc 
OOO 


serine protease, subtilase family, putative 


vJKrUU/oT 


QAG0R77 


1062 


hvnothptinal orotein 

1 IV Uwil Iwllwdl IJI VWwM 1 


ORF00762 


SAG0678 




endopeptidase O DEGENERATE 


ORF00766 


SAG0679 


286 


hydrolase, alpha/beta fold family, putative 


ORF00767 


SAG0680 


339 


hypothetical protein 


ORF00768 


SAG0681 


353 


conserved domain protein 


ORF00769 


SAG0682 


409 


permease, putative 


ORF00770 


SAG0683 




transmembrane protein Vexp3, putative FRAMESHIFT 


ORF00774 


SAG0684 


223 


ABC transporter, ATP-binding protein 
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URFQ0775 


SACaUboo 


A TO 

472 


conserved hypothetical protein 


vJKrUU77o 




2b1 


DNA-entry nuclease, putative 


UKrUU777 


O A f~*r\CQ~7 

oAbUbo/ 


212 


DedA family protein, putative 


ORF00778 


SAG0688 


218 


ABC transporter, ATP-binding protein 


ORF00779 


SAG0689 


257 


membrane protein, putative 


ORF00780 i 


SAG0690 


272 


conserved hypothetical protein 


ORF00781 j 


SAG0691 


294 


transcriptional regulator, LysR family j 


ORF00783 


SAG0692 


193 


regulatory protein, putative 


! ORF00785 | 


SAG0693 


377 


IS1548, transposase 


ORF00786 | 


SAG0694 


173 


regulatory protein, putative, truncation 


ORF00787 


SAG0695 


330 


D-lactate dehydrogenase 


ORF00788 


SAG0696 


516 


sodiumrgalactoside symporter family protein, putative 


ORF00789 


SAG0697 


341 


2-keto-3-deoxygluconate kinase 


ORF00790 


SAG0698 


599 


beta-glucuronidase 


ORF00791 


SAG0699 


223 


transcriptional regulator, GntR family 


| ORF00792 


SAG0700 


205 


2-dehydro-3-deoxyphosphogluconate aldolase/4- 
hydroxy-2-oxoglutarate aldolase 


j ORF00793 


SAG 0701 


466 


Glucuronate isomerase 


ORF00794 


SAG0702 


348 


mannonate dehydratase 


UKrUUYyt) 


bAbU/Uo 


07A 


D-mannonate oxidoreductase 


ORrU079D 


O A r>A7A /« 

SACj0704 


270 


hydrolase, haloacid dehaiogenase-like family 


ORF00797 


O A AA7HC 

SAG0705 


CAC 

596 


glycosyl hydrolase, family 3 


UKrUUYyo 


o a rri7nc 
oALjU AJb 


obi 


proline dipeptidase 


UKrUO/99 


O A fT\~7r\~7 

oAbu f U r 


oo4 


transcriptional regulator, RegM family 


r\ d err* c\ o nr» 


oACj070o 


4oo 


alpha amylase family protein 


ORF00801 


SAG0709 


332 


glycosyl transferase, group 1 family protein 


ORF00802 


SAG0710 


444 


glycosyl transferase, group 1 family protein 


ORF00803 


SAG071 1 


647 


threonyl-tRNA synthetase 


ORF00804 


SAG071 2 


234 


DNA-binding response regulator 


ORF00805 


SAG0713 


339 


conserved hypothetical protein 


ORF00806 


SAG0714 


188 


conserved hypothetical protein 


| ORF00807 


SAG0715 


216 


amino acid ABC transporter, permease protein 


ORF00808 


SAG0716 


231 


amino acid ABC transporter, permease protein 


ORF00809 


SAG0717 


266 


amino acid ABC transporter, amino acid-binding 
protein 


ORF00810 


O A /^r\~7 H o 

SAGU flO 




amino acid ABC transporter, ATP-binding protein 


ORF00811 


SAG0719 


236 


DNA-binding response regulator 


ORF00812 


SAG0720 


449 


sensory box histidine kinase 


ORF00813 


SAG0721 


269 


metallo-beta-lactamase family protein 


ORF00814 


SAGO 722 


122 


conserved hypothetical protein 


ORF00815 


SAGO 723 


236 


ribonuclease III 


ORF00816 


SAG0724 


1 179 


SMC family protein 


ORF00817 


SAG 0725 


265 


hydrolase, haloacid dehalogenase-like family 


ORF00818 


SAG 0726 


274 


hydrolase, haloacid dehalogenase-like family 


ORF00819 


SAG0727 


536 


signal recognition particle-docking protein FtsY 


ORF00820 


SAG0728 


270 


ABC transporter, substrate-binding protein 


ORF00821 


SAG0729 


300 


ABC transporter, permease protein, putative 


ORF00822 


SAG0730 


42 


ABC transporter, ATP-binding protein 


ORF00823 


SAG0731 


347 


bacterial luciferase family protein 


ORF00824 


SAG0732 


720 


transcriptional accessory protein Tex, putative 


ORF00825 


SAG0733 


142 


conserved hypothetical protein 


ORF00826 


SAG0734 


87 


phage shock protein C, putative 


ORF00827 


SAG0735 


44 


hypothetical protein 


ORF00828 


SAG0736 


311 


HPr(Ser) kinase/phosphatase 



676 



WO 2004/018646 



PCT/US2003/026827 



Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00830 


SAG0737 


OC7 


prolipoprotein diacylglyceryl transferase 


ORF00832 


SAG0738 




conserved hypothetical protein 


ORF00833 


SAG0739 


H A O 

14o 


conserved hypothetical protein 


ORF00834 


oAGu74U 




conservea nypoineuocii proiem 


ORF00835 


SAG0741 


O AO 


peptidase, uoz Tamny, putative 


ORF00836 


SAG0742 


4^0 


peptidase, Uoz Tamny 


! ORF00837 


o a r\~7 A o 


fyj 


conserved hypothetical protein 


ORF00838 


SAG0744 


265 


membrane protein, putative 


ORF00839 


SAG0745 


446 


ivin 1 iC+/rezi+ transporter, NKAiyiK Tamny 


ORF00840 


SAG0746 


369 


riboflavin biosynthesis protein RibD 


ORF00841 


SAG0747 


208 


riboflavin synthase, alpha subunit 


ORF00842 


SAG0748 


397 


riboflavin biosynthesis protein Rib A 


ORF00843 


SAG0749 


156 


riboflavin synthase, beta subunit 


ORF00844 


SAG0750 


496 


lysyl-tRNA synthetase 


ORF00845 


SAG0751 


300 


hydrolase, haloacid dehalogenase-like family 


ORF00846 


C» A An7CO 

SAG0752 


213 


phosphoglycerate mutase family protein 


ORF00847 


oAGU / Do 


10/ 


ebsC family protein, putative 


ORr00o4o 


oAGU f o4 


OAK 


conservsa aornain proiein 


ORrUOooU 


oAoU/OD 




peptioase, Uoz Tamny 


UKrUUooZ 


OA An7Rft 




conserveo nypoineiiodi proitjin 


ADCf>AOCO 

OKrUUoOo 


0MV3U /Of 


I zy 


lipoprotein, putciuvc? 


UKrUUoOD 


CAAn7W 


CQQ 

oyy 


ongoenoopepiiuase r, puicauvo 


Aocnnocc 
UKrUUoOb 


OAAA7KQ 

0A0U 1 oy 


yo j 


pnospnoenoipyruvcJie oaruoAyiciot? 


ORF00857 


SAG0760 


377 


IS1548, transposase 


ORF00859 


SAG0761 


422 


cell division protein, FtsW/RodA/SpoVE family 


ORF00861 


SAG0762 


398 


translation elongation factor Tu 


ORF00863 


SAG0763 


252 


triosephosphate isomerase 


ORF00865 


SAG0764 


230 


phosphoglycerate mutase 


ORF00866 


SAG0765 


681 


penicillin-binding protein 2b 


ORF00867 


SAG0766 


198 


recombination protein RecR 


ORF00868 


SAG0767 


348 


D-alanine-D-alanine ligase 


ORF00869 


SAG0768 


455 


UDP-N-acetylmuramoylalanyl-D-glutamyl-2,6- 
Qiammopimeiate— u-aianyi-u-aianyi iigase 


ORF00870 


SAG0769 


406 


oxalate:formate antiporter 


ORF00871 


SAG0770 


228 


conserved hypothetical protein 


ORF00872 


SAG0771 


512 


cell wall surface anchor family protein 


ORF00873 


SAG0772 


514 


peptide chain release factor 3 


ORF00874 


SAG0773 


126 


conserved hypothetical protein 


ORF00876 


SAG0774 


244 


ABC transporter, ATP-binding protein 


ORF00878 


SAG0775 


220 


ABC transporter, permease protein 


ORF00879 


SAG0776 


276 


lipoprotein .putative 


ORF00880 


SAG0777 


528 


ATP-dependent RNA helicase, DEAD/DEAH box 
family 


ORF00882 


SAG0778 


88 


conserved hypothetical protein 


ORF00883 


SAG0779 


254 


conserved hypothetical protein 


ORF00884 


SAG0780 


246 


acyltransferase family protein 


ORF00885 


SAG0781 


217 


competence protein CelA 


ORF00887 


SAG0782 


745 


DNA internalization-related competence protein 
ComEC/Rec2 


ORF00888 


SAG0783 


269 


hydrolase, haloacid dehalogenase-like family 


ORF00889 


SAG0784 


314 


sugar-binding transcriptional regulator, Lacl family 


ORFooeeo 


SAG0785 


330 


conserved hypothetical protein 


ORF00891 


SAG0786 


242 


conserved domain protein 


ORF00892 


SAG0787 


345 


DNA polymerase III, delta subunit, putativeD 
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ORF00893 


SAG0788 


202 


superoxide dismutase, Fe-Mn 


ORF00894 


SAG0789 


283 


transcriptional antiterminator LicT 


ORF00895 


SAG0790 


622 


PTS system, beta-glucosides-specific 1 1 ABC 
components 


ORF00896 


SAG0791 


475 


6-phospho-beta-glucosidase 


ORF00898 


SAG0792 


364 


conserved hypothetical protein 


ORF00899 


SAG0793 


380 


conserved hypothetical protein TIGR00045 


ORF00900 


SAG0794 


418 


permease, GntP family 


ORF00902 


SAG0795 


354 


conserved hypothetical protein 


ORF00903 


SAG0796 


147 


transcriptional regulator, MarR family 


ORF00904 


SAG0797 


342 


S-adenosylmethionine:tRNA ribosyltransferase- 
isomerase 


ORF00905 


SAG0798 


ooci 


membrane protein, putative 


ORF00906 


oAGO/yy 


ooo 
Zoo 


g lucosamine-6-phosphate isomerase 


ORF00907 


SAGOoOO 


HQ 

olo 


Glutathione S-transferases domain protein 


ORrOOyOo 


o a prions 
SAGOoOl 


OOO 

zo9 


ribosomal small subunit pseudouridine synthase 


ORF00909 


SAGOoOz 


oo 
So 


hypothetical protein 


ORr 00910 


O A PAOflO 


OQO 

Ooo 


major facilitator family protein 


ORF0091 1 


bAG0o04 


o-i 


competence protein CoiA 


nDcnnm o ! 
<JKrUU9i4«d ; 


oAoUoUD 


DU1 


oiiyoenaopepiiaase d 


UKrUUyio 


oAbuoUo 


OAO 


nyuroiase, naioaciu oenaiugenase-iiKe iamny 


/-\DCTAAfi«< /t 
ORr00914 


oAGUoU f 


zoo 


\j-metnyitransTerase Tamiiy protein 


URrOU91b 


o a ononcj 
oAv^UoUo 


ouy 


protease maturation protein, putative 


UKr0091o 


oACjUoUy 


ID! 


conserved hypothetical protein 


ORF00919 ! 


SAG0810 


872 


alanyl-tRNA synthetase 


ORF00921 


SAG081 1 


238 


membrane protein, putative 


ORF00922 


SAG0812 


272 


glycosyl transferase, family 8 


ORF00923 


SAG0813 


81 


hypothetical protein 


ORF00924 


SAG0814 


95 


conserved domain protein 


ORF00925 


SAG0815 


71 


transcriptional regulator, Cro/CI family 


ORF00926 


SAG0816 


253 


conserved hypothetical protein 


ORF00927 


SAG0817 


187 


conserved hypothetical protein 


ORF00928 


SAG0818 


319 


ribonucleoside-diphosphate reductase 2, beta subunit 


ORF00929 


SAG0819 


719 


ribonucleoside-diphosphate reductase 2, alpha subunit 


ORF00930 


SAG0820 


74 


ribonucleoside-diphosphate reductase 2, NrdH-redoxin 


ORF00931 


SAG0821 


87 


phosphocarrier protein HPr 


ORF00932 


SAG0822 


577 


phosphoenolpyruvate-protein phosphotransferase 


ORF00933 


SAG0823 


475 


glyceraldehyde-3-phosphate dehydrogenase, NADP- 
dependent 


ORF00934 


SAG0824 


417 


polysaccharide deacetylase family protein 


ORF00935 


SAG0825 


360 


ATP-dependent RNA hehcase, DEAD/DEAH box 
family 


ORF00936 


SAG0826 


209 


uridine kinase 


UKruuyo / 


OnOUO^ i 


t UJ 


UUMotJI VCVJ 1 lypv/U ICllV/dl JJIULCIII 


ORF00938 


SAG0828 


554 


DNA polymerase III, gamma and tau subunits 


ORF00939 


SAG0829 


64 


conserved hypothetical protein 


ORF00940 


SAG0830 


311 


biotin— acetyl-CoA-carboxylase ligase 


ORF00941 


SAG0831 


398 


S-adenosylmethionine synthetase 


ORF00942 


SAG0832 


753 


hypothetical protein 


ORF00943 


SAG0833 


181 


hypothetical protein 


ORF00944 


SAG0834 


42 


hypothetical protein 


ORF00945 


SAG0835 


188 


conserved hypothetical protein 
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ORF00946 


SAG0836 


184 j 


conserved hypothetical protein 


ORF00948 


SAG0837 


428 


ABC transporter, ATP-binding protein 


ORF00950 


SAG0838 


233 


hypothetical protein 


ORF00951 


SAG0839 


226 


transcriptional regulator, TenA family 


ORF00952 


SAG0840 


265 


phosphomethylpyrimidine kinase 


| ORF00953 


SAG0841 


256 


hydroxyethylthiazole kinase 


! ORF00954 


SAG0842 


223 


thiamine-phosphate pyrophosphorylase 


ORF00955 


SAG0843 


419 


UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


ORF00956 


SAG0844 


184 


acetyltransf erase, GNAT family 


ORF00957 


SAG0845 


427 


CBS domain protein 


ORF00958 


SAG0846 


286 


methionine aminopeptidase, type I 


ORF00959 


SAG0847 


306 


ribonuclease BN, putative 


ORF00961 


SAG0848 


151 


GtrA family protein 


ORF00962 


SAG0849 


169 


conserved hypothetical protein 


ORF00963 


SAG0850 


652 


DNA Iigase, NAD-dependent 


ORF00964 


SAG0851 


339 


BmrU protein, putative 


ORF00966 


SAG0852 


766 


pullulanase, putative 


ORF00967 


SAG0853 


622 


1 ,4-alpha-gIucan branching enzyme 


| ORF00968 


SAG0854 


379 


glucose-1 -phosphate adenyiyltransferase 


ORF00969 


SAG0855 




glycogen biosynthesis protein GlgD FRAMESHIFT 


ORF00971 


SAG0856 


476 


glycogen synthase 


ORF00972 


SAG0857 


66 


ATP synthase F0, C subunit 


ORF00973 


SAG0858 


238 


ATP synthase F0, A subunit 


ORF00974 


SAG0859 


165 


ATP synthase F0, B subunit 


ORF00975 


SAG0860 


178 


ATP synthase F1, delta subunit 


ORF00976 


SAG0861 


501 


ATP synthase F1, alpha subunit 


ORF00977 


SAG0862 


293 


ATP synthase F1, gamma subunit 


ORF00978 


SAG0863 


468 


ATP synthase F1, beta subunit 


ORF00979 


SAG0864 


137 


ATP synthase F1 , epsilon subunit 


ORF00980 


SAG0865 


76 


conserved hypothetical protein 


ORF00981 


SAG0866 


423 


UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


ORF00982 


SAG0867 


63 


conserved hypothetical protein 


ORF00983 


SAG0868 


285 


DNA-entry nuclease 


ORF00984 


SAG0869 


346 


phenylalanyl-tRNA synthetase, alpha subunit 


ORF00985 


SAG0870 


173 


acetyltransf erase, GNAT family 


ORF00986 


SAG0871 


801 


phenylalanyl-tRNA synthetase, beta subunit 


ORF00987 


SAG0872 


300 


conserved hypothetical protein 


ORF00988 


SAG0873 


1077 


exonuclease RexB 


ORF00989 


SAG0874 


1207 


exonuclease RexA 


ORF00990 


SAG0875 


305 


magnesium transporter, CorA family, putative 


ORF00991 


SAG0876 


458 


tRNA modification GTPase TrmE 


ORF00992 


SAG0877 


636 


ABC transnorter ATP-bindina Drotein 


ORF00993 


SAGO 8 78 


322 


acetoin dehydrogenase, thymine PPi dependent, E1 
component, alpha subunit 


ORF00994 


SAG0879 


332 


acetoin dehydrogenase, thymine PPi dependent, E1 
component, beta subunit 


ORF00995 


SAG0880 


462 


acetoin dehydrogenase, thymine PPi dependent, E2 
component, dihydrolipoamide acetyltransferase 


ORF00996 


SAG0881 


585 


acetoin dehydrogenase, thymine PPi dependent, E3 
component, dihydrolipoamide dehydrogenase 


ORF00997 


SAG0882 


329 


lipoate-protein Iigase A 


ORF00998 


SAG0883 


261 


cobyric acid synthase, putative 
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ORF00999 


SAG0884 


447 


mur ligase family protein 


ORF01000 


SAG0885 


283 


conserved hypothetical protein TIGR00159 ! 


ORF01001 


SAG0886 


319 


Gram-positive signal peptide, YSIRK family domain 
protein I 


ORF01002 


SAG0887 


450 


phosphoglucomutase/phosphomannomutase family 
protein 


ORF01003 


SAG0888 


123 


conserved hypothetical protein 


ORF01004 


SAG0889 


126 


conserved hypothetical protein 


ORF01005 


SAG0890 


376 


oxygen-independent coproporphyrinogen III oxidase, 
putative 


ORF01006 


SAG0891 


245 


conserved hypothetical protein 


ORF01007 


SAG0892 


256 


hydrolase, haloacid dehalogenase-like family 


ORF01008 


SAG0893 


218 


conserved hypothetical protein 


ORF01009 


SAG0894 


1370 


conserved hypothetical protein 


ORF01010 


SAG0895 


289 


lipoyi-binding domain protein 


ORF01011 


SAG0896 


108 


oxidoreductase, putative 


ORF01012 


SAG0897 


221 


conserved hypothetical protein j 


ORF01013 


SAG0898 


83 


hypothetical protein 


ORF01014 


SAG0899 


57 


hypothetical protein 


ORF01015 


SAG0900 


56 


hypothetical protein 


ORF01016 


SAG0901 


127 


hypothetical protein 


ORF01018 


SAG0902 


45 


hypothetical protein 


ORF01019 


SAG0903 


44 


hypothetical protein 


ORF01021 


SAG0904 


56 


hypothetical protein 


ORF01022 


SAG0905 


138 


nucleoside diphosphate kinase 


ORF01023 


SAG0906 


610 


GTP-binding protein LepA 


ORF01024 


SAG0907 


877 


streptococcal histidine triad family protein 


ORF01025 


SAG0908 


203 


HD domain protein 


ORF01026 


SAG0909 


154 


acetyltransferase, GNAT family 


ORF01027 


SAG0910 


144 


PilB-related protein 


ORF01030 


SAG0911 


930 


cation-transporting ATPase, E1-E2 family 


ORF01031 


SAG0912 


367 


nucleoside diphosphate kinase domain protein 


ORF01032 


SAG0913 


212 


chloramphenicol acetyltransferase 


ORF01033 


SAG0914 


203 I 


conserved hypothetical protein 


ORF01034 


SAG0915 


405 | 


Tn916, transposase | 


ORF01035 


SAG0916 


67 


Tn916, excisionase 


ORF01037 


SAG0918 


76 


Tn916, hypothetical protein 


ORF01038 


SAG0919 


157 


Tn916, hypothetical protein 


ORF01039 


SAG0921 


117 


Tn916, transcriptional regulator, putative 


ORF01040 


SAG0923 


639 


Tn916, tetracycline resistance protein 


ORF01041 


SAG0925 


310 


Tn916, hypothetical protein 


I ORF01042 


SAG0926 


333 


Tn916, NLP/P60 family protein 


ORF01044 


SAG0927 


725 


Tn916, hypothetical protein FRAMESHIFT 


ORF01047 


SAG0928 




Tn916, hypothetical protein FRAMESHIFT 


ORF01048 


SAG0929 


168 


Tn916, hypothetical protein 


ORF01049 


SAG0930 


165 


Tn916, hypothetical protein 


ORF01050 


SAG0931 


73 


Tn916, hypothetical protein 


ORF01051 


SAG0932 


401 


Tn916, transcriptional regulator, putative 


ORF01052 


SAG0933 


461 


Tn916, FtsK/SpolllE family protein 


ORF01053 


SAG0934 


128 


Tn916, hypothetical protein 


ORF01054 


SAG0935 


104 


Tn916 t hypothetical protein 


ORF01056 


SAG0937 




ABC transporter, ATP-binding protein, FRAMESHIFT 


ORF01057 


SAG0938 


122 


transcriptional regulator, GntR family 


ORF01058 


SAG0939 


1034 


DNA polymerase III, alpha subunit 
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UKP KeT IMO. 


oAbxxxx rwi i>io. 


aa i 


minuiaiivii 


Uriru i uoy 


onouytu 




fi-nhncnhcyfruciokinaQp 


vjr\rU I UDU 


OnOUyT 1 


ouu 


n\/ri l\/atp kin asp 


ORF01061 


SAG0942 


185 


signal peptidase 1, putative 


ORF01062 ; 


SAoUy4o 


47 


nypoineucai protein 


ORF01063 


SAG0944 


604 


giucosarnine~Truciose-D"pnoopriaits cu i hi tuu eti ioici d©c? 

r !cr»mpri*7inn\ 
loUl i ici it.il ly / 


UKrU1Uo4 


oAouy*+o 


Oil 


IQi c Aft. tranQnnQaQP 

1 O 1 JtU, LI Cll IOp*JoCtOC3 


UKrUTUbb 


oAoUyn-D 


A HQ 

i uy 


nh n A nrritpin 

pilMrA LJIUIGIM 


UKrUlUoo 




OA L 
Z.IO 


aminn ar*iri ARC* tranennrfpr nprmsase DfOtein 

dllllllU dOIVJ nUv 11 dl lojJUl ICI , (JCI I iicaoo piui^ni 


OKrUTUby 


CAPnOAfi 

oAbUy4o 1 


zuy 


nmlnn ar*iH ARC* tranennrtpr ATP-hindina DTOtein 

all Ml IU dOlvi ADU 11 dl loUVJI LCI , r\ 1 1 *~> < > HJll ty vion I 


ORF01070 


SAG0949 


276 


amino acid ABC transporter, amino acid-binding 
protein 


ORF01072 


SAG0950 


82 


ribosomal protein S20 


ORF01073 


SAG0951 


306 


pantothenate kinase 


ORF01074 


SAG0952 


196 


conserved hypothetical protein 


ORF01075 


SAG0953 


129 


cytidine deaminase 


ORF01076 


SAG0954 


349 


lipoprotein 


ORF01077 


SAG0955 


511 


sugar ABC transporter, ATP-binding protein 


ORF01078 


SAG0956 


353 


sugar Ado transporter, permease proiein, puiauve 


OKrOIO/y 


CAPAQC;7 

oAbuyo / 


O I o 


ci mar ARP tranennrtpr nprmpasp nrotein DUtative 


ORFOI 080 
UixrU 1 UOu 


SAG0958 


456 


NADH oxidase 






329 


L- lactate dehydrogenase 


Ur\rU I UOid 




819 


DNA gyrase, A subunit 


Unru I uoo 


SAG0961 


247 


sortase SrtA 


ORPniHft4. 
Ur\r U 1 UOH- 




137 


glyoxylase family protein 


UiXlU IUOU 


55AG0963 


320 


conserved hypothetical protein 


v^/rxi \J i uou 


SAG0964 


375 


Na+/H+ exchanger family protein 


ORF01087 


SAG0965 


127 


IS1381, transposase OrfA 


vr\ru t uuu 


SAG0966 


129 


IS1381 , transposase OrfB 


ORF01089 


SAG0967 


520 


GMP synthase 


ORF01090 

wl \l w 1 WWW 


| SAG0968 


232 


transcriptional regulator, GntR family 


ORF01091 


SAG0969 


444 


gid protein 


ORF01092 


SAG0970 


247 


acetyltransferase, GNAT family 


vJKruluyo 


o Anno.71 

Onouy f I 


282 


linnnmtpin DUtative 

IILJWL/I UlCII 1 ,ksl4 Lull v w 


ORF01095 


SAG0972 




conserved hypothetical protein, FRAMESHIFT 


ORF01096 


SAG0973 


OziU 


nisin-resioiance proiein, puiaiivts 


\ ORF01097 


SAG0974 


ocn 


adu iransponer, mi r^-uiiiuiiiy piuLcin 


ORF01098 


SAG0975 


ool 


ADv irdrlopOllci, ptJIilltsdot? piUlclll, puidlivc 


ORF01099 


SAG0976 


222 


DiNA-Dinuing response reguidior 


ORF01100 


SAG0977 


312 


sensor histidine kinase 


ORF01101 


SAG0978 


ODD 


Slte-speciiic recomuinase, pneige iniegraoc idiiuiy 


ORF01102 


SAG0979 


ceo 

boo 


AtJU transporter, suDSiraie uiriuiriy protein, putduvt; 


vJKrUl TUo 




257 


conserved hypothetical protein 


ORF01104 


SAG0981 


228 


SatD 


ORF01106 


SAG0982 


521 


signal recognition particle protein 


ORF01108 


SAG0983 


110 


conserved hypothetical protein 


ORF01109 


SAG0984 


437 


sensor histidine kinase CiaH 


ORF01110 


SAG0985 


226 


DNA-binding response regulator CiaR 


ORF01111 


SAG0986 


849 


aminopeptidase N 


ORF01112 


SAG0987 


217 


phosphate transport system regulatory protein PhoU 



681 



WO 2004/018646 



PCT/US2003/026827 



Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF01113 


SAG0988 


252 


phosphate ABC transporter, ATP-binding protein PstB, 
putative 


ORF01114 


SAG0989 


267 


phosphate ABC transporter, ATP-binding protein PstB, 
putative 


ORF01115 


SAG0990 


295 


phosphate ABC transporter, permease protein PstA, 
putative 


ORF01116 


SAG0991 


305 


phosphate ABC transporter, permease protein 


ORF01117 


SAG0992 


286 


phosphate ABC transporter, phosphate-binding protein 


ORF01118 


SAG0993 


436 


NOL1/NOP2/sun family protein 


ORF01119 


SAG0994 


254 


inositol monophosphatase family protein 


ORF01120 


SAG0995 


93 


conserved hypothetical protein 


ORF01121 


SAG0996 


137 


conserved hypothetical protein 


ORF01122 


SAG0997 


310 


macrolide-efflux protein mreA/ribofiavin biosynthesis 
protein RibF 


ORF01123 


SAG0998 


294 


tRNA pseudouridine synthase B 


ORF01124 


SAG0999 


143 


acetyltransferase, GNAT family 


ORF01125 


SAG1000 


423 


conserved hypothetical protein 


ORF01126 


SAG1001 


196 


conserved hypothetical protein 


ORF01127 


SAG1002 


292 


protease, putative 


ORF01128 


SAG1003 


876 


permease, putative 


ORF01129 


SAG1004 


233 


ABC transporter, ATP-binding protein 


ORF01131 


SAG 1005 


706 


DNA topoisomerase I 


ORF01132 


SAG1006 


280 


DprA/SMF protein, putative DNA processing factor 


ORF01133 


SAG1007 


342 


iron-compound ABC transporter, iron-compound- 
binding protein 


ORF01134 


SAG1008 


253 


iron compound ABC transporter, ATP-binding protein 


ORF01135 


SAG1009 


324 


iron compound ABC transporter, permease protein 


ORF01136 


SAG1010 


320 


iron compound ABC transporter, permease protein 


ORF01137 


O A /"~» A f\A A 

SAG1011 


182 


acetyltransferase, CysE/LacA/LpxA/NodL family 


ORF01 138 


SAG1012 


253 


ribonuclease HI I 


ORF01 139 


C* A A (~\A *~> 

SAG1013 


283 


GTP-oinding protein 


ORF01 140 


A A (\A A 

SAG1014 


190 


conserved hypothetical protein 


ORF01 142 


SAG1015 


494 


carbon starvation protein CstA, putative 


ORF01143 


SAG1016 


244 


response regulator 


j—*. r~\ A AAA 

ORF01144 


SAG1017 


579 


sensor histidine kinase, putative 


r— * r*— /~v A A AC 

ORF01145 


SAG1018 


40 


hypothetical protein 


ORF01 146 


SAG1019 


39 


conserved hypothetical protein, FRAMtSHIFT 


ORF01148 


SAG 1020 


227 


hypothetical protein 


ORF01149 


SAG 1021 


107 


hypothetical protein 


ORF01 150 


SAG 1022 


177 


hypothetical protein 


ORF01151 


SAG1023 


48 


hypothetical protein 


ORF01152 


SAG1024 


183 


hypothetical protein 


ORF01153 


SAG1025 


149 


hypothetical protein 


ORF01156 


SAG 1026 




immunogenic secreted protein, DEGENERATE 


ORF01157 


SAG1027 


84 


conserved hypothetical protein 


ORF01158 


SAG1028 


196 


hypothetical protein 


ORF01159 


SAG1029 


101 


hypothetical protein 


ORF01160 


SAG1030 


304 


conserved hypothetical protein 


ORF01161 


SAG1031 


120 


extracellular protein, putative POINT MUATION 


ORF01162 


SAG1032 


85 


conserved hypothetical protein 


ORF01164 


SAG1033 


1309 


FtsK/SpolllE family protein 


ORF01166 


SAG1034 


55 


hypothetical protein 



682 



WO 2004/018646 



PCT/US2003/026827 



Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF01167 


SAG1035 


424 


conserved hypothetical protein 


ORF01168 


SAG 1036 


80 


conserved hypothetical protein 


ORF01 169 


SAG 1037 


157 


hypothetical protein 


ORF01 172 


c a /~» 4 aoo 
SAGIOoO 


1003 


phage infection protein, putative 


ORF01 173 


0 a ^a non 

SAG1 0o9 


96 


conserved hypothetical protein 


ORF01 174 


SAG 1040 


260 


conserved domain protein 


ORF01175 


SAG 1041 


107 


hypothetical protein 


ORF01176 


SAG 1042 


1060 


carbamoyl-phosphate synthase, large subunit 


ORF01177 


SAG 1043 


358 


carbamoyl-phosphate synthase, small subunit 


ORF01178 


SAG 1044 


307 


aspartate carbamoyltransferase 


ORF01179 


SAG 1045 


430 


dihydroorotase, multifunctional complex type 


ORF01180 


SAG 1046 


209 


orotate phosphoribosyltransferase 


ORF01181 


SAG 1047 


233 


orotidine 5'-phosphate decarboxylase 


ORF01182 i 


SAG 1048 


410 


membrane protein, putative 


ORF01183 


SAG 1049 


513 


ABC transporter, ATP-binding protein 


ORF01 184 


SAG1 050 


1 12 


ribonucleotide reductase, truncation 


^\.^%^~f\A i< or 

ORF01 185 


O A /~**A ArT •< 

SAG 1051 


358 


aspartate-semialdehyde dehydrogenase 


ORF01 186 


O A A Ano 

SAG1 052 


4/ 


cell wall surface anchor family protein, putative 


ORF01187 


oAGIObo 


oO 


hypothetical protein 


OKFOIloo 


CAPHAC>1 1 

oAolOo4 : 


5o1 


cardiolipin synthetase 


ORF01189 


0 a a acc 
SAG 1055 


55b 


formate—tetrahydrofolate ligase 


ORF01 190 


0AC3I 05b 


000 

ooy 


lipoate-protein ligase A 


ORF01191 ! 


O A f~* A f~\C~7 

oAGIOOA 




conserved hypothetical protein 


ORF01192 


oAGIObo 


2/2 


conserved hypothetical protein 


ORF01193 


O A f^A t\cc\ 

SAG 1059 


110 


glycine cleavage system H protein, putative 


ORF01194 


0 a a r\cr\ 

SAG 1060 


000 
o2o 


bacterial luciferase family protein 


ORF01195 


O A A r\CA 

SAG 1061 


399 


oxidoreductase, rMN-Dinaing 


ORF01197 


SAG 1062 


282 


lipoate-protein ligase A family protein 


ORF01198 


SAG 10 00 


000 
22o 


flavo protein-related protein 


ORF01199 


O A A AC? A 

SAG 1064 


180 


flavoprotein family protein 


ORF01200 


O A A ftCC 

SAG1 065 


190 


membrane protein, putative 


I ORF01201 


SAG 1066 


572 


phosphoglucomutase 


ORF01202 


O A /"* A AC? "7 

SAG 1067 


A ~7Q 

1 fo 


IS861 , trans posase OrfA 


ORF01203 


SAG1 06o ) 


277 


IS861 , transposase OrfB 


ORF01204 


SAG1 069 


AC 

bo 


hypothetical protein 


ORF01205 


A f\A rvTn 

SAG1070 


577 


ABC transporter, ATP-binding/permease protein 


ORF01206 


SAG1071 


573 


ABC transporter, ATP-binding/permease protein 


r- i r—r\ a OA "7 

ORF01207 


SAG1 072 


200 


conserved hypothetical protein 


ORF01208 


SAG 1073 


00c 
o2o 


conserved hypothetical protein 


ORF01209 


0 a r*A r\~7 a 

SAG 1074 


>1 <4 O 

41o 


Serine hydroxymethyltransf erase 


ORF01210 


0 a r^A c\~rc 
OAG1075 


H OO 

too 


ouao/Yciu/Yrao/YWiU tamiiy protein 


<JRr0121 1 


oAldlO/o 


2/ O 


moaiTication metnyiase, rtemtx Tamuy 


r\ocr\A o-io 
UKrO 1212 


OA ah n"7"7 


OCA 

ooy 


peptide chain release factor 1 


ADCn-< OHO 

ORr0121o 


O A f~* A 070 
OAGIO/O 


loy 


thymidine kinases 


ORF01214 


SAG1079 


60 


4-oxalocrotonate tautomerase 


ORF01215 


SAG1080 


47 


hypothetical protein 


ORF01216 


SAG 1081 


312 


ApbE family protein 


ORF01217 


SAG1082 


200 


conserved hypothetical protein 


ORF01218 


SAG1083 


411 


conserved hypothetical protein 


ORF01219 


SAG1084 


262 


formate/nitrite transporter family protein 


! ORF01220 


SAG1085 


424 


xanthine permease 


ORF01221 


SAG1086 


193 


xanthine phosphoribosyltransferase 


ORI*01222 


SAG1087 


327 


guanoslne monophosphate reductase 


ORF01223 


SAG1088 


446 


drug resistance transporter, EmrB/QacA family, 
putative 
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ORF01224 


SAG 1089 


230 


conserved hypothetical protein 


ORF01225 


SAG1090 


666 


potassium uptake protein, putative 


ORF01226 


SAG1091 


216 


oxidoreductase, short chain dehydrogenase/reductase 
family, FRAMESHIFT 


ORF01227 


SAG109Z 


330 


phosphate acetyltransferase 


ORF01228 


SAG 1093 


294 


ribosomal large subunit pseudouridine synthase, RluD 
subfamily 


oc?Pni 99a 


oMo i uy*f 


97ft 
Zf O 


conservea nypoineiicai proiem 


ur\ru izou 




99*3 
ZZo 


GTP pyrophosp ho kinase family protein 


iJKrUlzol 


onblUyo 


a. on 
»yu 


conserved hypothetical protein 


vJKrU Izo^d 


oAolUy / 


oz4 


ribose-phosphate pyrophosphokinase 


UKrUlzoo 


OAAHriQQ 


Of 1 


cysteine desulphurase 


LJKrU1zo4 


OAAHnnn 

oAuiuyy 


no 


conserved hypothetical protein 


ORF01235 


SAG1100 


210 


DNA-binding protein 


ORF01236 


SAG1101 


226 


DNA repair protein RadC 


| ORF01237 


SAG1102 


377 


membrane protein, putative 


ORF01238 


SAG1103 


478 


6-phospho-beta-glucosidase 


ORF01239 


SAG1104 


204 


platelet activating factor, putative 


ORF01240 


SAG1105 


273 


hydrolase, haloacid dehalogenase-like family 


ORF01241 


SAG1106 


309 


transcriptional regulator, AraC family, putative 


ORF01242 


SAG1107 


510 


voltage-gated chloride channel family protein 


ORF01243 ! 


SAG1108 


357 


spermidine/putrescine ABC transporter, 
spermidine/putrescine-binding protein 


ORF01244 


SAG1109 


258 


spermidine/putrescine ABC transporter, permease 
protein 


ORF01245 


SAG1110 


264 


spermidine/putrescine ABC transporter, permease 
protein 


ORF01246 


SAG1111 


384 


spermidine/putrescine ABC transporter, ATP-binding 
protein 


ORF01247 


SAG1112 


300 


UDP-N-acetyienolpyruvoylglucosamine reductase 


ORF01248 


SAG1113 


162 


2-amino-4-hydroxy-6-hydroxymethyldihydropteridine 
pyrophosphokinase 


UKru iz*fy 


QAA4 A \A 


A 9H 
IZU 


dihydroneo pterin aldolase 


v-Jr\ru izou 


oMoi no 


9f57 
ZO A 


dihydropteroate synthase 


Of?pni 9^i 

Ur\rU IZO I 


OMvj I I I O 


1 ft7 
I Of 


yj i \r cycionyaroiase i 


nppni 9*^9 

UKrU 1 ZOZ 


oMvj III/ 


4ZU 


Toiyipoiygiuiamate syninase 


vJKrUiZoo 


oAol 11 o 


one • 

zyo 


rarD protein 


oi?pni 9Rd 
kJrvrU l zo*f 




9ftft 
Zoo 


nomosenne Kinase 


Ur\rU I ZOO 


OAb I I ZU ! 


497 
*fZ / 


homoserine dehydrogenase 


UKrU I ZOO 


Q Aril -191 ! 


zyo 


polysaccharide deacetylase family protein 


Ur\rU IZOf 


Q Af^1 1 99 
oMb I I ZZ 


O I o 


iransponer, dl»Lt i Tamiiy protein 


UrsrU 1 ZOO 


QA A-1 i 90 

oAb 1 1 Zo 




nypoineiicai proiem 


vJKrUTZoy 


6Abl 1 z4 


/I f^ft 

4oo 


aldehyde dehydrogenase family protein 


ORF01260 


SAG 1125 


335 


membrane protein 


r\pcni 9£M 
UKPU I ZD J 


O Arftil i 9ft 

oAol iZO 


99ft 
ZZO 


conserved hypothetical protein 


ORF01262 


SAG1127 


113 


conserved hypothetical protein, FRAMESHIFT 


ORF01263 




187 


hypothetical protein 


ORF01264 


SAG1128 


65 


transcriptional regulator, Cro/CI family 


ORF01265 


SAG1129 


36 


hypothetical protein 


| ORF01266 


SAG1130 


49 


hypothetical protein 


ORF01268 


SAG1131 


164 


thiol peroxidase 


ORF01269 


SAQ1132 


219 


conserved hypothetical protein 


ORF01272 


SAG1133 


254 


conserved hypothetical protein 
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ORF01273 


SAG 11 34 


213 


transcriptional regulator, GntR family/potassioum 
uptake protein, TrkA family 


ORF01274 


SAG 11 35 


183 


gls24 protein, putative 


ORF01275 


SAG 11 36 




conserved hypothetical protein FRAMESHIFT 


ORF01276 


SAG 11 37 


180 


gls24 protein, putative 


ORF01277 


SAG 11 38 


64 


conserved hypothetical protein 


ORF01279 


SAG 11 39 


193 


conserved hypothetical protein 


ORF01280 


SAG1140 


82 


conserved hypothetical protein 


ORF01281 


SAG1141 


112 


conserved hypothetical protein 


ORF01282 


SAG1142 


759 


ATP-dependent DNA helicase PcrA 


ORF01283 


SAG 11 43 


100 


conserved hypothetical protein, FRAMESHIFT 


! ORF01284 


SAG 11 44 


441 


uracil permease 


ORF01285 


SAG 1145 


448 


sodium:alanine symporter family protein 


ORF01286 


SAG 11 46 


411 


cation efflux family protein 


ORF01287 


SAG 1147 


130 


conserved hypothetical protein 


ORF01288 


SAG1148 


231 


membrane protein, putative 


ORF01289 j 


SAG1149 


207 


conserved hypothetical protein 


ORF01290 


SAG1150 


400 


ribosomal protein S1 


ORF01291 


SAG1151 


76 | 


conserved hypothetical protein 


ORF01292 


SAG1152 


340 


branched-chain amino acid aminotransferase 


ORF01294 


SAG1153 


819 


DNA topoisomerase IV, A subunit 


ORF01295 


SAG 11 54 


653 


DNA topoisomerase IV, B subunit 


ORF01296 


SAG1155 


207 


conserved hypothetical protein TIGR00023 


ORF01297 


SAG 11 56 


217 


uracil-DNA glycosylase 


ORF01298 


SAG1157 


161 


conserved hypothetical protein 


ORF01299 i 


SAG1158 


413 


CMP-N-acetylneuraminic acid synthetase NeuA 


ORF01300 


SAG 11 59 


209 


neuD protein 


ORF01301 


SAG1160 


384 


UDP-N-acetylglucosamine-2-epimerase NeuC 


ORF01302 


SAG1161 


341 


N-acetyl neuramic acid synthetase NeuB 


ORF01303 


SAG1162 


466 


cpsL protein 


ORF01304 


SAG1163 


318 


cpsVK protein 


ORF01305 


SAG1164 


321 


cpsVJ protein 


ORF01306 


SAG1165 


327 


cpsVO protein 


ORF01307 


SAG1166 


295 


cpsVN protein 


ORF01308 


SAG1167 


241 


cpsVM protein 


ORF01309 


SAG1168 


364 


cpsVH protein 


ORF01310 


SAG 11 69 


163 


CpsVG 


ORF01311 


SAG1170 


149 


CpsF 


ORF01312 


SAG1171 


462 


CpsE 


ORF01313 


SAG1172 


229 


CpsD protein 


ORF01314 


SAG1173 


230 


cpsC protein 


ORF01315 


SAG1174 


243 


capsular polysaccharide biosynthesis protein CpsB 


ORF01316 


SAG1175 


485 


capsular polysaccharide biosynthesis protein CpsA 


ORF01317 


SAG1176 


290 


capsular polysaccharide synthesis operon 
transcriptional regulator CpsY 


ORF01318 


SAG1177 


255 


cpslaS protein 


ORF01319 


SAG1178 


236 


purine nucleoside phosphorylase 


ORF01320 


SAG1179 


418 


voltage-gated chloride channel family protein, putative 


ORF01321 


SAG1180 


269 


purine nucleoside phosphorylase 


ORF01322 


SAG1181 


135 


arsenate reductase 


OR?01323 


SAG 1182 


403 


phosphopentomutase ! 


ORF01324 


SAG1183 


223 


ribose 5-phosphate isomerase 
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ORF01326 


SAG 11 84 


236 


conserved hypothetical protein 


ORF01327 


SAG1185 


262 


tributyrin esterase 


ORF01328 


SAG1186 


553 


metal lo-beta-lactamase superfamily protein 


ORF01329 


SAG1187 


253 


ABC transporter, ATP-binding protein 


ORF01330 


SAG1188 


287 


ABC transporter, permease protein 


ORF01331 | 


SAG1189 


334 


conserved hypothetical protein 


ORF01332 | 


SAG1190 


551 


adherence and virulence protein A 


ORF01333 


SAG1191 


239 


alpha-acetolactate decarboxylase 


ORF01334 


SAG1192 


560 


acetolactate synthase, catabolic 


ORF01335 


SAG1193 


408 


TPR domain protein 


ORF01336 


SAG1194 


396 


membrane protein 


ORF01337 


SAG1195 


153 


MutT/nudix family protein 


ORF01338 


SAG1196 


160 


mutator MutT protein 


ORF01339 


SAG 11 97 


1072 


hyaiuronidase 


ORF01340 


SAG 11 98 


348 


dTDP-glucose 4,6-dehydratase 


ORF01341 , 


SAG1199 


197 


dTDP-4-dehydrorhamnose 3,5-epimerase j 


ORF01342 


SAG 1200 


289 


glucose-1 -phosphate thy midyly [transferase 


ORF01343 


SAG 1201 


367 


iminodiacetate oxidase, putative 


ORF01344 


SAG1202 


262 


conserved hypothetical protein TIGR00486 


ORF01345 


SAG1203 


227 


conserved hypothetical protein 


ORF01346 


SAG1204 


226 


DNA replication protein Dnad, putative 


ORF01347 


SAG 1205 


172 


adenine phosphoribosyltransferase 


ORF01348 


SAG 1206 


854 


conserved domain protein 


ORF01349 


SAG 1207 


32 


hypothetical protein 


ORF01350 


SAG 1208 


732 


single-stranded-DNA-specific exonuclease RecJ 


ORF01351 


SAG1209 


253 


oxidoreductase, short chain dehydrogenase/reductase 
family 


ORF01352 


SAG 1210 


309 


metallo-beta-lactamase superfamily protein 


ORF01353 


SAG 1211 


215 


conserved hypothetical protein 


i ORF01354 


SAG1212 


412 


GTP-binding protein HfIX 


ORF01355 


SAG 121 3 


296 


tRNA delta(2)-isopentenylpyrophosphate transferase 


ORF01356 


SAG1214 


58 


hypothetical protein 


ORF01357 


SAG1215 


305 


exfoliative toxin A, putative 


ORF01358 


SAG1216 


1252 


pullulanase, putative 


ORF01361 


SAG 121 7 




conserved hypothetical protein, FRAMESHIFT 


ORF01362 


SAG 121 8 


194 


conserved hypothetical protein 


ORF01363 


SAG 121 9 


468 


peptidase, M20/M25/M40 family 


ORF01364 


SAG 1220 


200 


nitroreductase family protein 


ORF01365 


SAG1221 




glycerophosphoryl diester phosphodiesterase, 
putative, POINT MUTATION 


ORF01367 


SAG 1222 


593 


excinuclease ABC, C subunit 


ORF01368 


SAG1223 


255 


conserved hypothetical protein 


ORF01369 


SAG1224 


446 


MATE efflux family protein 


ORF01370 


SAG1225 


136 


conserved hypothetical protein 


ORF01371 


SAG1226 


165 


conserved hypothetical protein 


ORF01372 


SAG1227 


198 


conserved hypothetical protein 


ORF01373 


SAG 1228 


96 


!SSdy1 , transposase OrfA 


ORF01374 


SAG1229 


259 


ISSdyl , transposase Orf B 


ORF01375 


SAG1230 


96 


conserved hypothetical protein 


ORF01377 


SAG1231 




transposase OrfB, IS3 family, degenerate 
FRAMESHIFT 


ORF01379 


SAG1232 


77 


transposase OrfB, IS3 family, truncation 


ORF01380 


SAG 1233 


822 


streptococcal histidine triad family protein 


ORF01381 


SAG1234 


306 


laminin-binding surface protein 
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nppni oao 


OAPH OOC 


A OK 
4^0 


GBSM, group H intron, maturase 


Ur\ru i OOO 


OApi OOC 




c5a peptidase precursor FRAMESHIFT 


UrvrU 1 00*r 


00*7 


444 


hypothetical protein 


UKrU 1 OOO 


oAbl<dOo 


ono 


nypotneiicai proiein 


UKrU IOOO 


0A0 1 Zoy 


7C 
/O 


conserved hypothetical protein 


UKrUlooY 


O A O A O/l r\ 

oA(j 1z4U 


12o 


conserved hypothetical protein, truncation 


UKrUlooo 


O A 4 0>1 H 


70 


transposase urrA, 100 tamily 


ad cm -i oqq 
UKrUlooy 


O A /""»«! O /I O 


67 


transposase OrfB, IS3 family, truncation 


ORF0139U 


SAG 1243 


96 


ISSdyl, transposase OrfA FRAMESHIFT 


ORrOloyi 


O A A *~l A A 

SAG 1244 


259 


ISSdyl, transposase OrfB 


ORF01392 


SAG 1245 


38 


hypothetical protein 


ORF01393 


SAG 1246 


389 


hypothetical protein 


ORF01394 


SAG1247 


399 


integrase, phage family 


ORF01395 


SAG1248 


75 


conserved hypothetical protein 


ORF01396 


SAG1249 


74 


transcriptional regulator, Cro/CI family 


ORF01397 


SAG1250 


621 


Tn5252, relaxase 


ORF01398 


SAG1251 


121 


Tn5252, Orf 9 protein 


ORF01399 


SAG 1252 


120 


Tn5252, Orf 10 protein 


i ORF01401 


SAG 1253 


435 


transposase, ISL3 family 


ORF01403 


SAG 1254 


546 


mercuric reductase 


ORF01404 


SAG 1255 


130 


mercuric resistance operon regulatory protein MerR 


Anrn A Ana 

UKrU14Ub 


O A {~* A OCC 


A A O 

14Z 


IS861, transposase OrfB, truncation 


ORF01407 


O A r~> A OC7 

oAC3lz57 


709 


cation-transporting ATPase, E1-E2 family 


URrU14Uo 


o a A OCO 


122 


cadmium efflux system accessory protein 


f^\T~>r~r\ a a r\c\ 

UKrU14uy 


O A 4 OCO 

oAvjizoy 


99 


conserved hypothetical protein 


UKrU141U 


o a /■"* a ocn 
oACs 1Z0U 


262 


hypothetical protein 


UKFU14 i i 


G A/"*»H OC«4 


iyo 


conserved hypothetical protein 


UKrU 141z 


G A/*"M OCO 


oyo 


cation-transporting ATPase, E1-E2 family 


Anrn^ AAA 

UKrU 1414 


oAtjlZoo 




conserved domain protein, FRAMESHIFT 1 


HDCnH A H C 

UKrU1410 


OAPH OCy( 


4 A Q 

14o 


transcriptional repressor CopY, putative 


UKrU141b 


oAbubo 




cadmium resistance transporter, putative 


HDrA** AA\*7 

UKrU 141 f 


G A/"» *1 OCC 
oAbl^DD 


4 CO 


hypothetical protein 


UKrU141o 


OAP-(Oft7 

oA<j1<£0/ 


■i n 0 
TUo 


hypothetical protein 


UKFU1419 * 


o a A OCO 


OO A 

^oU 


repressor protein, putative 


UKrU14^U 


G AfM OCQ 


44 


hypothetical protein 


UKrU14Zl 


QAPI 07f\ 


A~7 A 

4/1 


ImpB/MucB/SamB family protein 


UKrU14^o 


QAPi 07-1 


-i -1 c 

lib 


conserved hypothetical protein 


UKrU 14^4 


oAvjj 1 £.( d. 


1 uz 


conserved hypothetical protein 


UKhU14^0 


O A O -1 0"70 


Ho 


conserved hypothetical protein 


URr0142b 


OAP^l o~7 /I 

oAol4d/ r 4 


a on 
1z9 


conserved hypothetical protein 


UKFU142A 


G A /""* OTC 


7C 

ID 


hypothetical protein 


UKrU14zo 


oAbu/D 


OOO 


conserved hypothetical protein 


ORF01430 


SAG 1277 


163 


hypothetical protein | 


ORF01431 


SAG 1278 


96 


hypothetical protein 


ORF01432 


SAG 1279 


99 


conserved domain protein 


ORF01433 


O A A OO/I 

SAG 1280 


2274 


Helicases conserved C-terminal domain protein 


ORF01434 


SAG1281 


183 


hypothetical protein 


ORF01435 


SAG1282 


63 


lipoprotein, putative 


ORF01436 


SAG1283 


1631 


ceil wall surface anchor family protein 


ORF01437 


SAG1284 


196 


abortive infection protein AbiGI 


ORF01438 


SAG1285 


281 


abortive infection protein AbiGI 1 


ORF01439 


SAG 1286 


933 


conserved hypothetical protein 


ORF01440 


SAG1287 


776 


conserved hypothetical protein 


ORF01441 


SAG1288 


117 


conserved hypothetical protein, DEGENERATE 
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ORF01442 


SAG1289 


284 


conserved hypothetical protein 


ORF01443 


SAG 1290 


80 


hypothetical protein 


ORF01444 


SAG 1291 


605 


Tn5252, Orf 21 protein, internal deletion 


ORF01445 


SAG1292 


162 


hypothetical protein 


| ORF01446 


SAG 1293 


194 


protease, putative 


ORF01447 


SAG 1294 


77 


conserved hypothetical protein 


j ORF01449 


SAG 1295 


127 


conserved hypothetical protein 


ORF01450 


SAG1296 


142 


conserved hypothetical protein 


ORF01451 


SAG1297 


451 


type li DNA modification methyltransferase Spn5252IP 


ORF01452 


SAG1298 


31 


hypothetical protein 


ORF01453 


SAG1299 


272 


conserved hypothetical protein 


ORF01454 


SAG1300 


57 


conserved hypothetical protein 


ORF01455 


SAG1301 


121 


ribosomal protein L7/L12 


ORF01456 


SAG1302 


166 


ribosomal protein L10 


ORF01458 


SAG1303 


702 


ATP-dependent Clp protease, ATP-binding subunit 


ORF01459 


SAG 1304 


32 


hypothetical protein 


ORF01460 


SAG1305 


314 


homocysteine S-methyltransferase MmuM, putative 


ORF01461 


SAG1306 


458 


amino acid permease 


ORF01463 


SAG 1307 


216 


hypothetical protein 


ORF01464 


SAG1308 


167 


hypothetical protein 


ORF01465 


SAG1309 


30 


hypothetical protein 


ORF01466 


SAG1310 


182 


transcriptional regulator, TetR family 


ORF01467 


SAG1311 


198 


GTP-binding protein 


ORF01468 


SAG1312 


408 


ATP-dependent uip protease, ai r'-Dinuing suDumt 

L/ipA 


ORr014oy 


O A H OH O 


00 


L/Ui locJi vtJu I lyputi leuoai pivjicsni 


ORF01470 


O A/~M QH A 




ojriyui uiuictitJ i euu^iaoc 


ORF01471 


oACjIoIo 


07Q 


inymiuyiaie syninaoe 


ORF01472 




oyu 




ORF01473 


A >i OH V 


4Z/ 


o-nyuroxy-o-rneiriyiy luteal y j-ovj/-\ i cuuuiaoc 


ORF01474 


CAA-IOHO 

oACjIolo 




conservea nypoineuuai piuiein 


ORF01475 


I O A /~*H OH n 




nemoiysm 111, puiauve 


ORF01476 




oU4 


conserveo nypuuitsuucii pjuitJin i ivjrsvvj i*t/ 


ORF01477 


C A r*H OOH 


Zo4 


ylUlainiOne O-irdlloltJlaoC lalllliy piUicJiu 


ORF01478 


O A i"^H OOO 


VO 
f dC 


conserveo aomdin ptuitsiri 


ORF01479 


CAP 1 ! OOO 


00 -1 
00 1 


loUUel I lei lyi "vJIJJI IL>o[JJ letlC uclla"lOul l ici aoc 


ORF01480 


O A/~»H OOyl 


OQA 

00U 


pnospnoiTievaioriciic Miia&t; 


ORF01481 


O A /""» H OOC 


OH A 
0 14 


UjpriOopilUl McvaiUI IctLtJ ut;v>cii uuAyiaoc 


ORF01482 




zyz 


mawalnnafo Unoco r\ t itcitiv/P 3 
IilcVcilUI iciltJ isiildoc?, puiciuvc 


ORF01483 


O A H 0 07 


4uy 


sensor nisuaine KinaistJ 


ORF01484 


O A /""» H OOO 

SAG 1328 


OOO 


UiNA-Dinaing response reguiaiur 


ORF01485 


SAG1329 


208 


GTP pyrophosphokinase family protein 


ORF01486 


SAG1330 


68 


hypothetical protein 


ORF01488 


CAOHOOH 

bAolool 


y / y 


r\o proiein 


ORF01489 


SAG1332 


146 


transcriptional regulator, MarR family, putative 


ORF01490 


SAG1333 


690 


5-nucleotidase family protein 


ORF01491 


SAG1334 


136 


polypeptide deformylase, putative 


ORF01492 


SAG1335 


449 


NADP-specific glutamate dehydrogenase 


ORF01494 


SAG1336 


169 


conserved hypothetical protein 


ORF01495 


SAG1337 


589 


ABC transporter, ATP-binding/permease protein 


ORF01496 


SAG1338 


579 


ABC transporter, ATP-binding/permease protein 


ORF01497 


SAG1339 


157 


acetyitransferase, GNAT family 



688 



WO 2004/018646 



PCT/US2003/026827 



Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa j 


Annotation 


ORF01498 


SAG1340 


622 


ABC transporter, ATP-binding protein 


ORF01499 


SAG1341 


402 


polyA polymerase family protein 


ORF01500 


SAG1342 


282 


DegV family protein 


ORF01501 


SAG 1343 


126 


conserved hypothetical protein j 


ORF01502 


SAG1344 


177 


hypothetical protein 


ORF01503 


SAG1345 


164 


conserved hypothetical protein 


ORF01504 


SAG1346 


641 


PTS system, fructose specific IIABC components 


ORF01505 


SAG1347 ! 


303 


1 -phosphofructokinase 


ORF01506 


SAG1348 


247 


lactose phosphotransferase system repressor 


ORF01507 


SAG1349 


411 


beta-lactam resistance factor I 


ORF01508 ! 


SAG1350 


544 


surface antigen-related protein 


ORF01509 


SAG1351 


307 


2-dehydropantoate 2-reductase, putative j 


ORF01510 


SAG1352 


356 


regulatory protein, putative 


ORF01511 


SAG1353 


330 


pyridine nucleotide-disulphide oxidoreductase family 
protein 


ORF01512 


SAG 1354 


251 


tRNA (guanine-N1 )-methyltransferase 


ORF01513 


SAG1355 


172 


16S rRNA processing protein RimM 


ORF01515 


SAG1356 


503 


transcriptional regulator, RofA family 


ORF01516 


SAG1357 


80 


KH domain protein 


ORF01517 


SAG1358 


90 


ribosomal protein S16 


ORF01518 


SAG1359 


415 


permease, putative 


ORF01519 


SAG1360 


236 


ABC transporter, ATP-binding protein 


ORF01520 


SAG1361 


414 


conserved hypothetical protein 


ORF01522 


SAG1362 


532 


carbamoyl-phosphate synthase, large subunit, putative 


ORF01523 


SAG1363 


356 


carbamoyl-phosphate synthase, small subunit 


ORF01524 


SAG1364 


173 


pyrimidine operon regulatory protein 


ORF01525 


SAG1365 


296 


ribosomal large subunit pseudouridine synthase, RluD 
subfamily 


ORF01526 


SAG1366 


154 


lipoprotein signal peptidase 


ORF01527 


SAG1367 


301 


transcriptional regulator, LysR family 


ORF01528 


SAG1368 


94 


ribosomal protein L27 


ORF01529 


SAG1369 


112 


conserved hypothetical protein 


ORF01530 


SAG1370 


104 


ribosomal protein L21 


ORF01531 


SAG1371 


392 


conserved hypothetical protein 


ORF01532 


SAG 1372 


404 


thiamine biosynthesis protein Thil 


ORF01533 


SAG 1373 


381 


cysteine desulphurase 


ORF01535 


SAG 1374 


150 


conserved hypothetical protein 


ORF01536 


SAG1375 


449 


glutathione reductase 


ORF01537 


SAG 1376 


111 


conserved nypotneucai proiem 


ORF01538 


SAG 1377 


ooo 
OOO 


chorismate synthase 


ORF01539 


SAG 1378 


OOO 


o-uenyaroquinate symnase 


ORF01540 


SAG 1379 




o-aenyoroquinaie oenyaraia&e 


ORF01541 


SAG 1380 


385 


conserved hypothetical protein 


ORF01542 


SAG1381 


714 


sulfatase 


ORF01543 




1 1 C7 


I IUUoUI I idJ piULCIII LXU 


ORF01544 


SAG1383 


66 


ribosomal protein L35 ! 


ORF01545 


SAG1384 


176 


translation initiation factor IF-3 


ORF01546 


SAG1385 


227 


cytidylate kinase 


ORF01547 


SAG1386 


174 


conserved hypothetical protein 


ORF01548 


SAG1387 


65 


ferredoxin, 4Fe-4S 


ORF01549 


SAG1388 


163 


conserved hypothetical protein 


ORF01550 


SAG1389 


406 


peptidase t 


ORF01551 


SAG1390 


544 


polysaccharide biosynthesis protein, putative 
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ORF01552 


SAG1391 


484 


UDP-N-acetylmuramoylalanyl-D-glutamate--2,6- 
diaminopimelate ligase 


ORF01553 


SAG 1392 


264 


iron compound ABC transporter, ATP-binding protein 


ORF01554 


SAG1393 


310 


iron compound ABC transporter, substrate-binding 
protein 


ORF01555 


SAG1394 


341 


iron compound ABC transporter, permease protein 


ORF01556 


SAG1395 


333 


iron compound ABC transporter, permease protein 


ORF01557 


SAG1396 


217 


conserved hypothetical protein 


ORF01558 


SAG1397 


311 


inorganic pyrophosphatase, manganese-dependent 


ORF01559 


SAG1398 


262 


pyruvate formate-lyase-activating enzyme 


ORF01560 


SAG1399 


444 


CBS domain protein 


ORF01561 


SAG1400 


188 


conserved hypothetical protein 


ORF01563 


SAG1401 


311 


conserved hypothetical protein TIGR01212 


ORF01564 


SAG1402 


213 


PAP2 family protein 


ORF01565 


SAG1403 J 


194 


membrane protein, putative 


ORF01566 


SAG1404 


308 


cell wall surface anchor family protein i 


ORF01567 


SAG1405 


294 


sortase family protein 


ORF01568 


SAG1406 


293 


sortase family protein 


ORF01569 


SAG1407 


705 


cell wall surface anchor family protein 


ORF01570 


SAG1408 


901 


cell wall surface anchor family protein 


ORF01571 


SAG1409 


326 


transcriptional regulator, RofA family FRAMESHIFT 


ORF01572 


SAG1410 


379 


glycosyl transferase, group 1 family protein 


ORF01573 


SAG1411 


282 


exopolysaccharide biosynthesis protein, putative 


ORF01574 


SAG1412 


474 


exopolysaccharide biosynthesis protein, putative 


ORF01575 


SAG1413 


454 


hypothetical protein 


ORF01576 


SAG1414 j 


308 


glycosyl transferase, group 2 family protein 


ORF01577 


SAG1415 


311 


glycosyl transferase, group 2 family protein 


i ORF01578 


SAG1416 


352 


dTDP-glucose 4,6-dehydratase, putative 


ORF01579 


SAG1417 


240 


4-diphosphocytidyl-2C-methyl-D-erythritol synthase, 
putative 


ORF01580 


SAG1418 


259 


licD protein, putative 


ORF01581 


<~v A /—\ A A A r\ 

SAG1419 


577 


hypothetical protein 


ORF01582 


SAG1420 


117 


conserved hypothetical protein 


ORF01583 


SAG1421 


243 


glycosyl transferase, group 2 family protein 


ORF01584 


SAG1422 


313 


glycosyl transferase, group 2 family protein 


ORF01585 


SAG1423 


384 


conserved hypothetical protein 


ORF01586 


SAG1424 


284 


dTDP-4-dehydrorhamnose reductase 


ORF01587 


SAG1425 


113 


conserved hypothetical protein 


ORF01589 


SAG1426 


369 


RNA polymerase sigma-70 factor 


ORF01590 


SAG1427 


602 


DNA primase 


ORF01591 


SAG1428 


125 


large conductance mechanosensitive channel protein 


ORF01592 


SAG1429 


58 


ribosomal protein S21 j 


ORF01593 


SAG1430 


167 


conserved hypothetical protein 


ORF01594 


SAG1431 


268 


amino acid ABC transporter, amino acid-binding 
protein 


ORF01596 


SAG1432 


347 


ammonium transporter family protein 


ORF01597 


SAG1433 


375 


conserved hypothetical protein 


ORF01598 


SAG1434 


328 


rhodanese family protein 


ORF01599 


SAG1435 


101 


conserved hypothetical protein 


ORF01600 


SAG1436 


457 


glycerol-3-phosphate transporter, putative 
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ORF01601 


SAG1437 


55 


hypothetical protein 


ORF01602 


SAG1438 


754 


glycogen phosphorylase 


ORF01603 


SAG1439 


498 


4-alpha-giucanotransferase 


ORF01604 


SAG1440 


342 


maltose operon repressor MaIR, putative 


ORF01605 


SAG1441 


415 


maltose/maltodextrin ABC transporter, 
maltose/maltodextrin-binding protein 


ORF01606 


SAG1442 


456 


maltose ABC transporter, permease protein 


ORF01607 


SAG1443 


278 


maltose ABC transporter, permease protein 


ORF01608 


SAG1444 


490 


proton/peptide symporter family protein 


ORF01610 


SAG1445 




MutT/nudix family protein, FRAMESHIFT 


ORF01611 


SAG1446 


62 


hypothetical protein 


| ORF01612 


SAG1447 


441 


conserved hypothetical protein 


| ORF01613 


SAG1448 


502 


glycosyl transferase, group 1 family protein 


ORF01614 


SAG1449 


795 


preprotein translocase SecA subunit, putative 


ORF01615 


SAG1450 


330 


conserved domain protein 


ORF01617 


SAG1451 


494 


conserved hypothetical protein 


ORF01618 


SAG1452 


514 


conserved hypothetical protein 


ORF01619 


SAG 1453 


409 


preprotein translocase SecY family protein 


ORF01621 


SAG1454 


398 


conserved hypothetical protein 


ORF01622 


SAG1455 


295 


glycosyl transferase, group 2 family protein 


ORF01623 


SAG1456 


312 


glycosyl transferase, family 8, degenerate 


ORF01624 


SAG1457 


129 


IS1381, transposase OrfB 


ORF01625 


SAG1458 


127 


IS1381 , transposase OrfA 


ORF01626 


SAG1459 


413 


glycosyl transferase family 8 


ORF01627 


SAG1460 


401 


glycosyl transferase, family 8 


ORF01628 


SAG1461 


335 


conserved hypothetical protein 


ORF01630 


SAG1462 


970 


cell wall surface anchor family protein 


ORF01632 


SAG1463 




transcriptional regulator, RofA family POINT 
MUTATION 


ORF01634 


SAG1464 


663 


excinuclease ABC, B subunit 


ORF01635 


SAG1465 


306 


protease, putative 


ORF01636 


SAG1466 


727 


glutamine ABC transporter, glutamine-binding 
protein/permease protein, putative 


ORF01637 


SAG1467 


246 


glutamine ABC transporter, ATP-binding protein, GlnQ 
putative 


ORF01638 


SAG1468 


116 


conserved hypothetical protein 


ORF01639 


SAG1469 


52 


conserved hypothetical protein 


ORF01640 


SAG 1470 


437 


GTP-binding protein, GTP1/Obg family 


ORF01641 


SAG1471 


42 


conserved hypothetical protein 


ORF01643 


SAG 1472 


413 


aminopeptidase PepS 


ORF01645 


SAG1473 


192 


cell wall surface anchor family protein 


ORF01646 


SAG1474 


680 


amidase family protein j 


ORF01647 


SAG 1475 


240 


ribosomal small subunit pseudouridine synthase A 


ORF01648 


SAG 1476 


280 


oxidoreductase, aldo/keto reductase family 


ORF01650 


SAG1477 


224 


nitroreductase family protein 


ORF01651 


SAG 1478 


130 


lactoylglutathione lyase 


ORF01652 


SAG 1479 


i 308 


glycosyl transferase, group 2 family protein 


ORF01653 


SAG1480 


462 


amino acid permease 


ORF01654 


SAG1481 


155 


SsrA-binding protein 


ORF01655 


SAG1482 


801 


exoribonuciease, VacB/Rnb family 


ORF01657 


SAG1483 


78 


preprotein translocase, SecG subunit 


ORF01658 


SAG 1485 


389 


multi-drug resistance protein 


ORF01660 


SAG1486 


548 


hypothetical protein 


ORF01661 


SAG1487 


233 


ABC transporter, ATP binding protein 
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ORF01662 


SAG 1488 


195 


dephospho-CoA kinase 


| ORF01663 


SAG 1489 


273 


formamidopyrimidine-DNAglycosyiase 


ORF01665 


SAG 1490 


282 


transcriptional regulator, MutR family 


ORF01666 


SAG1491 


530 


hypothetical protein 


j ORF01667 


SAG1492 


58 


hypothetical protein 


ORF01668 


SAG 1493 




hypothetical protein 


ORF01670 


SAG1494 


32 


hypothetical protein 


ORF01672 


SAG1495 


81 


protease, putative, POINT MUTATION 


ORF01673 


SAG 1496 


110 


hypothetical protein 


ORF01674 


SAG1497 


37 


hypothetical protein 


ORF01675 


SAG1498 


133 


hypothetical protein 


ORF01677 


SAG1499 


299 


GTP-binding protein Era 


ORF01678 


SAG1500 


132 


diacylglycerol kinase 


ORF01679 


SAG1501 


161 


conserved hypothetical protein TiGR00043 


ORF01680 


SAG1502 


268 


tetracenomycin polyketide synthesis O- 
methy (transferase TcmP, putative 


ORF01681 


SAG1503 


39 1 


hypothetical protein 


ORF01682 


SAG1504 


38 


hypothetical protein 


ORF01683 


SAG1505 


158 


MutT/nudix family protein 


ORF01684 


SAG1506 


267 


hypothetical protein 


ORF01685 


SAG1507 


345 


PhoH family protein 


ORF01686 


SAG1508 


590 


67 kDa Myosin-crossreactive streptococcal antigen 


ORF01687 


SAG1509 


71 


conserved hypothetical protein 


ORF01688 


SAG1510 


169 


peptide methionine sulfoxide reductase 


ORF01689 


SAG1511 


284 


conserved hypothetical protein 


ORF01690 


SAG1512 


185 


ribosome recycling factor 


ORF01691 


SAG1513 


242 


uridylate kinase 


ORF01692 


SAG1514 


226 


peptide ABC transporter, ATP-binding protein 


ORF01693 


SAG1515 


262 


peptide ABC transporter, ATP-binding protein 


ORF01694 


SAG1516 


255 


peptide ABC transporter, permease protein 


ORF01695 


SAG1517 


314 


peptide ABC transporter, permease protein 


ORF01696 


SAG 151 8 


525 


peptide ABC transporter, peptide-binding protein j 


ORF01697 


SAG1519 


229 


ribosomal protein L1 


ORF01698 


SAG1520 


141 


ribosomal protein L1 1 


ORF01699 


SAG 1521 


388 


transposase, IS30 family, putative \ 


ORF01700 


SAG1522 


460 


transporter, major facilitator family 


ORF01702 


SAG1523 


404 


peptidase, M20/M25/M40 family j 


ORF01703 


SAG1524 


294 


transcriptional regulator, LysR family 


ORF01704 


SAG1525 


117 


conserved hypothetical protein 


ORF01705 


SAG1526 


178 


IS861 , transposase OrfA 


ORF01706 


SAG1527 


277 


IS861, transposase OrfB 


ORF01707 


SAG1528 


571 


chorismate binding enzyme 


ORF01708 


SAG1529 


785 


FtsK/SpoNIE family protein 


ORF01709 


SAG 1530 


267 


peptidyl-prolyl cis-trans isomerase, cyclophiiin-type 


ORF01710 


SAG1531 


277 


manganese ABC transporter, permease protein 


ORF01711 


SAG1532 


238 


manganese ABC transporter, ATP-binding protein 


ORF01712 


SAG1533 


308 


manganese ABC transporter, manganese-binding 
adhesion liprotein 


ORF01713 


SAG1534 


215 


iron-dependent transcriptional regulator 


ORF01714 


SAG1535 


229 


5-methylthioadenosine nucleosidase/S- 
adenosylhomocysteine nucleosidase 


ORF01715 


SAG 1536 


89 


conserved hypothetical protein 


ORF01716 


SAG 1537 


184 


MutT/nudix family protein 
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ORF01718 


r"» A /"» A COO 

SAG 1538 


459 


UDP-N-acetylglucosamine pyrophosphorylase 


ORF01719 


a /*■» a con 

SAG 1539 


31 


hypothetical protein 




SAG 1540 


137 


conserved hypothetical protein 


ORF01721 


SAG 1541 


125 


glyoxalase family protein 


f\z-\v~r\A\ too 

ORF01722 


SAG 1542 


318 


oxidoreductase, Gfo/ldh/MocA family 


ORF01724 


SAG 1543 




conserved hypothetical protein, FRAMESHIFT 


ORF01725 


SAG 1544 


232 


gluconate 5-dehydrogenase, putative 


ORF01726 


SAG 1545 


78 


conserved hypothetical protein 


ORF01727 


SAG 1546 


82 


conserved hypothetical protein 


ORF01729 


SAG 1547 


166 


acetyltransferase, GNAT family 


| ORF01730 


SAG1548 


422 


glycosyl transferase, group 2 family protein 


ORF01731 


SAG1549 


127 


IS1381 , transposase OrfA 


ORF01732 


SAG1550 


129 


IS1381 , transposase OrfB 


ORF01733 


SAG1551 


67 


hypothetical protein 


! ORF01734 


SAG 1552 


719 


conserved hypothetical protein 


ORF01735 


SAG 1553 


477 


hypothetical protein 


ORF01736 


SAG1554 


225 


hypothetical protein 


ORF01737 


SAG 1555 


231 


hypothetical protein 


ORF01738 


SAG1556 


445 


branched-chain amino acid transport system II carrier 








protein 


ORF01739 


SAG1557 


665 


methionyl-tRNA synthetase 


ORF01740 


SAG 1558 


291 


tellurite resistance protein TehB 


ORF01741 


SAG1559 


231 


membrane protein, putative 


ORF01742 


SAG1560 


40 


hypothetical protein 


ORF01743 


SAG1561 


405 


PTS system component, putative i 


ORF01744 


SAG1562 


280 


conserved hypothetical protein 


ORF01745 


SAG1563 


275 


exodeoxyribonuclease 


ORF01746 


SAG1564 


118 


conserved hypothetical protein 


ORF01747 


SAG1565 


158 


methylated-DNA-protein-cysteine S-methyitransferase 


ORF01748 


SAG1566 


393 


D-isomer specific 2-hydroxyacid dehydrogenase family 








protein 


ORF01749 


SAG1 567 


182 


acetyltransferase, GNAT family 


ORF01 750 


SAG1 568 




pnospnosenne aminotransferase hKAivitonir 1 


f^.rw r\ a —ret 

ORF01752 


0 a a nen 
SAC310D9 


Zl 1 


copper homeostasis protein CutC, putative 


ORF01753 


SAG 1570 


34 


conserved hypothetical protein 


ORF01754 


SAG 1571 


53 


hypothetical protein 


ORF01755 


SAG1572 


287 


tetrapyrrole methylase family protein 


ORF01756 


SAG1573 


108 


conserved hypothetical protein 


ORF01758 


SAG1574 


287 


DNA polymerase Hi, delta prime subunit, putative 


ORF01759 


SAG1575 


211 


thymidylate kinase 


ORF01761 


SAG1576 


267 


transposase, IS30 family, putative, truncation 


ORF01763 


SAG1577 


219 


AcuB family protein 


ORF01764 


SAG1578 


236 


branched-chain amino acid ABC transporter, ATP- 








binding protein 


ORF01765 


SAG1579 


254 


branched-chain amino acid ABC transporter, ATP- 








binding protein 


ORF01766 


SAG1580 


317 


branched-chain amino acid ABC transporter, 








permease protein 


ORF01767 


SAG1581 


289 


branched-chain amino acid ABC transporter, 








permease protein 


ORF01769 


SAG1582 


388 


branched-chain amino acid ABC transporter, amino 








acid-binding protein 


ORF01770 


SAG1583 


81 


conserved hypothetical protein 


ORF01772 


SAG1584 


377 


IS1548, transposase 
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ORF01773 


SAG1585 


196 


Al r-Qepenasni oip protease, pruieuiyuo ouuuiml oiui^ 


ORF01774 


SAGIooo 


20» 


uracil pnospnoriDosyixransrerase 


ORF01775 


O A CQ7 

SAbloo/ 


ooy 


amii ivJll dl lold doc, Udoo 1 


ORF01777 


O A f~~* "4 COO 

SAGIooo 


A QO 


DMA mothx/ltraneforQe/a TrmW ^amilv/ n ro 1 1 rv 9 
r\lNM lIifcJLIiyillalloltJIaSe, 1 lIFin Idiliiiy, Ljiuup 


ORF01778 


SAG1589 


45U 


amino acia permease, piuauve j 


ORF01 779 


SAG 1590 


A A A 

449 


potassium upiaKe protein, i ik Tamny 


j ORF01780 


SAG 1591 


4/t) 


cation uptake protein, irKTamuy 


ORF01781 


SAG 1592 


83 


conserved hypothetical protein TIGR00278 


ORF01782 


SAG 1593 


240 


rlDosomai large suounit pseuaouname symndtsts d 


ORF01783 






r*nn <5<arvpri hvnnthf^tiral nrntpin TIGR00281 


ORF01784 


oAbioyo 


ZOO 


1 Inrharartprb'firi AHR nOtt1^54 ! 

lci\ aL.lCI l^_C-Li AAVw/Fv, vUVJ loot 


ORrul /ob 


oAuToyo 




int^nra^p/rpenmhina^s Dhaoe inteorase familv 


/^DETA-I 7QC 

UKrUI rob 


OnU 1 353 / 


157 


CBS domain protein 


(JKrUl /o/ 


Ono i oyo 


173 


conserved hypothetical protein 


(JKrUI too 


QAf^1 ^QQ 


324 


HAM1 protein 


CJKrUl /oy 


Onu I DUU 


264 


glutamate racemase 


UKrUI f »U 




79 


conserved hypothetical protein 


ORF01791 


SAG1602 


180 


membrane protein, putative 


ORF01792 


SAG 1603 


A 7<J 
I/O 


tranc^rlntiAnol rant i\atr\r Hirvfin ronrPQQnr fflmilv 
IranSCnpilDiial IcyulalUI, UIUUII Itspicooui lamiiy 


ORF01793 


SAG 1604 




memDrane proiein, puidiive 


ORF01794 


SAG 1 DUO 


^07 

Id/ 


Gonservtiu iiyfjuuitjuocii piuitJiii 


ORF01795 


SAG1606 


247 


RNA methy transferase, TrmH family 


ORF01796 


SAG 1607 


92 


acylphosphatase 


ORF01797 


SAG 1608 


310 


membrane protein, putative 


ORF01799 


SAG 1609 


221 


amino acid ABC transporter, permease protein 


ORF01800 


SAG1610 


285 


amino acid ABO transporter, suustrate-uinaing proiein 


ORF01801 


O A A* A d A A 


4o0 


arniaase idrTiiiy piuiciii 


ORF01802 


SAnjIo 




iranbunpuuri tJiunyciuuii lauiui 


ORF01803 


SAG1613 


OUU 


uncnaracisnzcu doi\, i wjo \&\ i my wv-/vj iwo, puiauvc 


ORr0ioU4 


OAblDl4 


ID/ 


anAtvltran^fpmsp GNAT familv 


r»orA<i one 


oAblolO 




1 mP-M-arptvlmuramate — alanine liaase 


ORFOI0Q0 


earn ri (s 




AAnQpn/pH hvnothetical Drotein 


/■^OfTA-i QA7 


OnO ID U 


on 


hvnnthptical orotein 


URrUloUo 


QA(^H CH O 

OAb lOlO 


mi? 


^nf9 familv Drotein 


ORrOlolU 


Onb I D I y 


077 

Of / 


i<^-i tran*?Dosase 


UKrUiol 1 


Ono 1 OZU 


too 


phosphoglycerate dehydrogenase-related protein 






mn 


Drimosomal Drotein Dnal 


UKrUlol o 




0?7 1 


conserved hypothetical protein 






159 


conserved hypothetical protein TIGR00244 


UKrUiolO 


QAf^ A9A 


ou 1 


<5pnc;nr histidine kinase CsrS 


ORF01816 


oACjIo^O 


99Q 


riMA hinrlinn racnnriQP rpfinlatnr O^rR 

LJINM~UH lUll iy 1 copui loc i cyuiaiui \-»oi i \ 


ORF01817 


SAG 1626 


A ~T~7 
All 


conserveo nypoineiiL-di piuicni 


ORF01818 


SAG1627 


296 


heat shock protein HtpX 


ORF01820 


SAG 1628 


184 


lemA protein 


ORF01821 


SAG1629 


237 


glucose-inhibited division protein B 


ORF01822 


SAG1630 


459 


sodium transport family protein 


ORF01823 


SAG1631 


223 


potassium uptake protein, Trk family, putative 


ORF01824 


SAG1632 


276 


cobalt transport family protein 


ORF01825 


SAG1633 


558 


ABC transporter, ATP-binding protein 


ORF01826 


SAG1634 


! 212 


conserved hypothetical protein 


ORF01827 


SAG1635 


402 I sodiumtdicarboxylate symporter tamily protein 
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ORF01828 


SAG1636 


455 


branched-chain amino acid transport system II carrier 
protein 


ORF01 829 


SAG 1637 


351 


alcohol dehydrogenase, zinc-containing 


ORF01830 


SAG 1638 


230 


ABC transporter, permease protein 


ORF01831 


SAG1639 


356 


ABC transporter, ATP-binding protein 


ORF01832 


SAG 1640 


458 


peptidase, M20/M25/M40 family 


ORF01833 


SAG1641 


274 


lipoprotein, putative 


ORF01834 


SAG1642 


277 


ABC transporter, substrate-binding protein 


ORF01835 


SAG1643 


229 


glutamine amidotransferase, class I 


ORF01836 


SAG1644 


37 


hypothetical protein 


ORF01837 


SAG1645 


238 


conserved hypothetical protein TIGR01033 


ORF01838 


SAG1646 


32 


hypothetical protein 


j ORF01839 


SAG1647 


328 


dihydroxyacetone kinase family protein 


ORF01840 


SAG1648 


178 


transcriptional regulator, TetR family, putative 


ORF01842 


SAG1649 


37 


hypothetical protein 


ORF01843 


SAG 1650 


329 


dihydroxyacetone kinase family protein 


ORF01844 


SAG1651 


192 


dihydroxyacetone kinase family protein 


ORF01845 


SAG 1652 


124 


conserved hypothetical protein 


ORF01846 


SAG1653 


237 


glycerol uptake facilitator protein 


ORF01847 


SAG 1654 


134 


conserved hypothetical protein 


ORF01848 


SAG1655 


237 


transcriptional regulator, MerR family 


ORF01849 


SAG1656 


369 


conserved hypothetical protein 


ORF01850 


SAG1657 


83 


hypothetical protein 


ORF01851 


SAG1658 


244 


conserved hypothetical protein 


ORF01852 


SAG 1659 


118 


iojap-related protein 


| ORF01853 


SAG 1660 


173 


isochorismatase family protein | 


ORF01854 


SAG1661 


195 


conserved hypothetical protein TIGR00488 


ORF01855 


SAG1662 


210 


conserved hypothetical protein TIGR00482 


ORF01856 


SAG 1663 


105 


conserved hypothetical protein TIGR00253 


! ORF01857 


SAG1664 


372 


GTP-binding protein 


ORF01858 


SAG1665 


177 


hydrolase, haloacid dehalogenase-Hke family 


ORF01859 


SAG1666 


295 


membrane protein 


ORF01860 


SAG1667 


480 


glutamyl-tRNA(Gln) amidotransferase, B subunit 


ORF01861 


SAG1668 


488 


glutamyl-tRNA(Gln) amidotransferase, A subunit 


ORF01862 


SAG1669 


100 


giutamyl-tRNA(Gln) amidotransferase, C subunit 


ORF01863 


SAG1670 


881 


pyruvate phosphate dikinase 


ORF01864 


SAG1671 


276 


conserved hypothetical protein 


ORF01865 


SAG 1672 


170 


CBS domain protein 


ORF01866 


SAG1673 


377 


3-hydroxyacyl-CoA dehydrogenase family protein 


ORF01867 


SAG 1674 


182 


isochorismatase family protein 


ORF01869 


SAG1675 


261 


transcriptional regulator CodY, putative 


ORF01870 


SAG1 676 


403 


aminotransferase, class I 


ORF01871 


SAG1677 


137 


universal stress protein family FRAMESHIFT 


ORF01872 


SAG1678 


460 


hydrolase, haloacid dehalogenase-like family 


ORF01873 


SAG1679 


320 


asparaginase family protein 


ORF01874 


SAG1680 


292 


shikimate 5-dehydrogenase 


ORF01875 


SAG 1681 


304 


oxidoreductase, aldo/keto reductase family 


ORF01876 


SAG1682 


671 


ATP-dependent DNA helicase RecG 


ORF01877 


SAG1683 


i 512 


immunogenic secreted protein, putative 


ORF01878 


SAG1684 


366 


alanine racemase 


ORF01879 


| SAG1685 


119 


holo-(acyl-carrier-protein) synthase 


ORF01880 


SAG1686 


335 


phospho-2-dehydro-3-deoxyheptonate aldolase 


ORF01881 


SAG1687 


842 


preprotein translocase, SecA subunit 


ORF01882 


SAG1688 


315 


mannose-6-phosphate isomerase, class I 


ORF01883 


SAG1689 


293 


fructokinase 
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OKrUlooo 


SAG 1690 


639 


PTS system 1 1 ABC components 


UKrOloob 


O A /*"» A OA H 

SAG 169 1 


479 


sucrose-6-phosphate hydrolase 


UKrUl oo7 


SAG lb 92 


320 


sucrose operon repressor ScrR 


HDCOi QQQ 


SAG1b9o 


AAA 


N utilization substance protein B 


UKrUlooy 


oAG1b94 


A 00 

129 


conserved hypothetical protein 


r»DCAH oaa 

(JKr01o90 


SAG 1695 


186 


translation elongation factor P 


OKr 01892 


SAG 1696 


38 


hypothetical protein 


I ORF01893 


SAG 1697 


48 


hypothetical protein 


ORF01 894 


SAG 1698 


99 


conserved hypothetical protein 


ORF01895 


SAG 1699 


30 


hypothetical protein 


ORF01896 


SAG1700 


76 


hypothetical protein \ 


ORF01897 


SAG1701 


56 


hypothetical protein 


ORF01898 


SAG 1702 


41 


hypothetical protein 


ORF01899 


SAG 1703 


54 


hypothetical protein 


ORF01900 


SAG1704 


150 


cytidine/deoxycytidylate deaminase family protein 


ORF01902 


SAG 1705 




peptidase, M24 family POINT MUTATION 


ORF01903 


SAG 1706 


238 


conserved hypothetical protein 


ORF01904 


SAG 1707 


499 


drug resistance transporter, EmrB/QacA family 


ORF01905 


SAG1 708 


38 


hypothetical protein 


ORF01906 


SAG 1709 


942 


excinuclease ABC, A subunit 


ORF01907 


f\ A /— \A T A f\ 

SAG1710 


223 


conserved hypothetical protein 


ORF01908 


SAG 1711 


314 


magnesium transporter, CorA family 


ORF01909 


SAG1712 


79 


ribosomal protein S18 


AOCA >i AH A 

QRF01910 


SAG 1713 


163 


single-strand binding protein 


nnrnH ah h 

ORF0191 1 


O A A "7 A A 

SAG 171 4 


95 


ribosomal protein S6 


/"> r-j C* A H A H O 

ORru1912 


SAG1715 


374 


A/G-specific adenine glycosylase 


ORFQ1913 


O A /"^H ~7H £> 

SAG 171 6 


197 


transcriptional regulator, Cro/CI family 


ADCftK AH ^ 

UKrU1914 


O aIoH "7H *7 


H A A 

104 


thioredoxin 


rsnm h ah c 

ORF01915 


O A f> H TH O 

SAG 171 8 


H 

166 


PAP2 family protein 


ORF01916 


O A H TTH A 

SAG1719 


779 


MutS2 family protein 


/""\ n r-/-\ A AH "T 

ORF01917 


O jy 4 TA A 

SAG 1720 


180 


conserved hypothetical protein 


ORF01918 


SAG 1721 


103 


conserved hypothetical protein 


ORF01919 


SAG 1722 


297 


ribonuclease HIM 


ORF0192G 


SAG 1723 


197 


signal peptidase I 


ORF01921 


SAG 1724 


806 


helicase, putative 


ORF01922 


SAG1 725 


160 


conserved hypothetical protein 


QRr0192o 


OA A 7A/J 

SAG1 726 


364 


DNA-damage inducible protein P 


ADCA-l AOH 

ORF01924 


O A A ~7A "7 

SAG1 727 


770 


formate acetyltransferase 


UKrU192t> 


O A/^H 700 

0A01 72o 


124 


rMN-Dinding protein 


ADCA-l f\Ofl 

UKrO 192b 


CAAH *7AA 


309 


conserved hypothetical protein 


ODCAHA07 

ORF01927 


SAG 1730 


251 


proteinase, putative, degenerate, FRAMESHIFT 


ORF01928 


0 a A 71 A 

SAG 1731 


298 


membrane protein, putative | 


nnrAi AAA 

ORF01929 


O A f~* A TOO 

SAG 1732 


282 


glycerol uptake facilitator protein, putative 


ORF01930 


SAG 1733 


150 


universal stress protein family 


ORF01931 


SAG 1734 


400 


transporter, putative 


ORF01932 


SAG1735 


219 


transcriptional regulator, Crp/Fnr family 


ORF01933 


SAG 1736 


761 


X-pro dipeptidyl-peptidase 


ORF01934 


SAG1737 


119 


hypothetical protein 


ORF01936 


SAG1738 


326 


polyprenyl synthetase family protein 


ORF01937 


SAG 1739 


582 


ABC transporter, ATP-binding protein CydC 


ORF01938 


SAG1740 


572 


ABC transporter, ATP-binding protein CydD 


ORF01939 


SAG1741 


339 


cytochrome d ubiquinol oxidase, subunit II j 


ORF01940 


SAG1742 


475 


cytochrome d oxidase, subunit I 


ORF01941 


SAG1743 


402 


pyridine nucleotide-disulphide oxidoreductase family 
protein 
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ORF01942 


SAG1744 


299 


prenyltransferase, UbiA family 


ORF01943 


SAG 1745 


148 


hypothetical protein 1 


ORF01944 


SAG 1746 


35 


hypothetical protein 


ORF01945 


SAG1747 


99 


conserved hypothetical protein TIGR00103 


ORF01946 


SAG1748 


396 


cyclopropane-fatty-acyl-phospholipid synthase 


ORF01947 


SAG1749 


241 


transcriptional regulator, merR family 


ORF01948 


SAG1750 


195 


exonuclease i 


ORF01949 


SAG1751 


178 


conserved hypothetical protein 


ORF01950 


SAG1752 


375 


conserved hypothetical protein TIGR00275 


ORF01951 


SAG1753 


260 


conserved hypothetical protein 


ORF01952 


SAG1754 


89 


ribosomal protein S14 


ORF01953 


SAG1755 


38 


hypothetical protein 


ORF01954 


SAG1756 


341 


conserved hypothetical protein 


ORF01957 


SAG 1757 


336 


O-sialoglycoprotein endopeptidase family protein 


ORF01958 


SAG1758 


135 


ribosomal-protein-alanine acetyltransferase, putative 


ORF01960 


SAG1 759 


230 


glycoprotease family protein, putative 


ORF01961 


SAG 1760 


76 


conserved hypothetical protein 


ORF01962 


SAG 1761 


559 


metallo-beta-lactamase superfamily protein ! 


ORF01963 


SAG 1762 


169 


conserved hypothetical protein 


ORF01964 


SAG 1763 


448 


gSutamine synthetase, type I 


ORF01965 


SAG 1764 


123 


transcriptional regulator GlnR 


ORF01967 


SAG 1765 


179 


conserved hypothetical protein 


ORF01969 


SAG 1766 


398 


phosphoglycerate kinase 


ORF01970 


SAG 1767 


289 


acid phosphatase 


ORF01971 


SAG 1768 


336 


glyceraldehyde 3-phosphate dehydrogenase 


ORF01972 


SAG 1769 


692 


translation elongation factor G 


ORF01973 


SAG 17 70 


156 


ribosomal protein S7 


ORF01974 


SAG 1771 


137 


ribosomal protein S12 


ORF01975 


SAG1 772 


270 


pur operon repressor 


ORF01976 


SAG 1773 


313 


HD domain protein 


ORF01977 


SAG 1774 


424 


conserved hypothetical protein 


ORF01978 


SAG 1775 


210 


conserved hypothetical protein 


ORF01979 


SAG 1776 


220 


ribulose-phosphate 3-epimerase 


ORF01980 


SAG 1777 


290 


conserved hypothetical protein TIGR00157 


ORF01981 


SAG 1778 


283 


rRNA (guanine-NI-)-methyltransferase, putative 


ORF01983 


SAG 1779 


290 


dimethyladenosine transferase 


ORF01984 


SAG 1780 


163 


hypothetical protein 


ORF01985 


SAG 1781 


186 


primase-related protein 


ORF01987 


SAG 1782 


260 


deoxyribonuclease, TatD family 


ORF01988 


SAG 1783 


90 


hypothetical protein 


ORF01989 


SAG 1784 


130 


hypothetical protein 


ORF01990 


SAG 1785 


430 


hypothetical protein 


ORF01991 


SAG 1786 


130 


hypothetical protein 


ORF01992 


SAG1787 


420 


dltD protein 


ORF01993 


SAG1788 


79 


D-alanyl carrier protein 


ORF01994 


SAG 1789 


4^1 


dltB protein 


ORF01996 


SAG1790 


511 


D-alanine-activating enzyme 


ORF01997 


SAG1791 


395 


sensor histidine kinase 


ORF01998 


SAG1792 


224 


DNA-binding response regulator 


ORF01999 


SAG1793 


44 


ribosomal protein L34 


ORF02000 


SAG1794 


451 


membrane protein, putative 


ORF02001 


SAG1795 


388 


transposase, IS30 family, putative 


ORF02002 


SAG1796 


575 


amino acid ABC transporter, permease protein 


ORF02004 


SAG1797 


407 


amino acid ABC transporter, ATP-binding protein 
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ORF02005 


SAG1798 


39 


hypothetical protein 


ORF02006 


SAG1799 


792 


xyluiose-5-phosphate/fructose-6-phosphate 
phosphoketolase 


ORF02007 


SAG1800 


363 


conserved hypothetical protein 


ORF02008 


SAG1801 


559 


transcriptional antiterminator, BgIG family 


ORF02009 


SAG1802 


253 


conserved hypothetical protein 


ORF02010 


SAG1803 


505 


carbohydrate kinase, FGGY family 


ORF02011 


SAG1804 


329 


hypothetical protein 


ORF02012 


SAG1805 


483 [ 


PTS system component, putative 


ORF02015 


SAG1806 


318 


glyoxylate reductase, NADH-dependent 


ORF02016 


SAG1807 


339 


hypothetical protein 


ORF02017 


SAG1808 


327 


sugar binding transcriptional regulator, Lad family 


ORF02018 


SAG 1809 


215 


transaldolase family protein 


ORF02019 


SAG1810 


238 


carbohydrate isomerase, AraD/FucA family 


ORF02020 


SAG1811 


287 


hexulose-6-phosphate isomerase, putative 


ORF02021 


SAG1812 


221 


hexuIose-6-phosphate synthase, putative ; 


ORF02022 


SAG1813 


161 


PTS system, IIA component 


ORF02023 


SAG1814 


92 


PTS system, II B component 


ORF02024 


SAG1815 


479 


transport protein SgaT, putative 


ORF02025 


SAG1816 


205 


hypothetical protein 


ORF02026 


SAG1817 


157 


hypothetical protein 


ORF02027 


SAG1818 


430 


adenylosuccinate synthetase 


ORF02028 | 


SAG1819 


340 


perfringolysin O regulator protein 


ORF02029 


SAG1820 


224 


conserved hypothetical protein 


ORF02030 


SAG1821 


750 


glutamate-cysteine Hgase-related protein 


ORF02031 


SAG1822 


272 


conserved hypothetical protein 


ORF02032 


SAG1823 


418 


conserved hypothetical protein 


ORF02033 


SAG1824 


291 


chaperonin, 33 kDa 


ORF02034 | 


SAG1825 


325 


NifR3/Smm1 family protein 


| ORF02035 


SAG1826 


213 


deoxynucleoside kinase family protein 


ORF02036 


SAG1827 


163 


phosphinothricin N-acetyltransferase 


ORF02037 


SAG1828 


815 


ATP-dependent Clp protease, ATP-binding subunit 


ORF02038 


SAG 1829 


154 


transcriptional regulator CtsR 


ORF02039 


SAG 1830 


153 


conserved hypothetical protein 


ORF02040 


SAG1831 


346 


translation elongation factor Ts 


ORF02041 


SAG1 832 


256 


ri bosom a I protein S2 


ORF02042 


SAG 1833 


186 


alky I hydroperoxide reductase, subunit C 


ORF02043 


SAG 1834 


rr a r\ 


a iky I hydroperoxide reductase, subunit F 


ORr0zU44 


O A -4 OOC 

SA(j1oo5 


H OA 

1o4 


conserved hypothetical protein 


ORr0204o 


oACal ooo 


OJ 


conserved hypothetical protein 


t~\ d mo C\ A c 

ORF0204D 


SAG 1837 


A CO 

4bo 


lysin, putative 


ORF02047 


SAG 1838 


109 


holin, putative 


ORF02048 


SAG1839 


136 


conserved hypothetical protein 


ORF02049 


SAG1840 


112 


hypothetical protein 


ORF02050 


O A A C\ A A 

SAG1 841 


76 


conserved domain protein 


ORF02051 


SAG1842 


1224 


PbIB, putative 


ORF02053 


SAG1843 


240 


conserved hypothetical protein 


ORF02056 


SAG1844 


911 


conserved hypothetical protein 


ORF02057 


SAG1845 


42 


hypothetical protein 


ORF02058 


SAG1846 


158 


hypothetical protein 


ORF02059 


SAG 1847 


227 


conserved hypothetical protein 


ORF02060 


SAG1848 


114 


conserved hypothetical protein 


ORF02061 


SAG1849 


115 


hypothetical protein 
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ORF02062 


SAG1850 


101 


hypothetical protein 


ORF02063 


SAG1851 


111 


conserved domain protein 


ORF02064 


SAG1852 


420 


conserved domain protein 


ORF02066 


SAG1 853 


180 


protease, putative 


ORF02067 


SAG1854 


380 


conserved hypothetical protein 


ORF02068 


SAG1855 


570 


terminase large subunit, putative 


ORF02069 


SAG1856 


161 


hypothetical protein 


ORF02070 


SAG1858 


95 


hypothetical protein 


ORF02071 


SAG1859 


180 


site-specific recombinase, phage integrase family 


ORF02072 


SAG 1860 


154 


conserved hypothetical protein 


ORF02073 


SAG1861 


119 


transcriptional regulator, Cro/CI family 


ORF02075 


SAG1862 


86 


hypothetical protein 


ORF02076 


SAG1863 


138 


single-strand binding protein 


ORF02077 


SAG1864 


68 


hypothetical protein 


ORF02078 


SAG1865 


74 


conserved hypothetical protein 


ORF02079 


SAG1866 


109 


conserved hypothetical protein 


ORF02080 


SAG1867 


163 


conserved hypothetical protein 


ORF02081 


SAG1868 


134 


hypothetical protein 


ORF02082 


SAG1869 


437 


type II DNA modification methyltransferase, putative 


ORF02083 


SAG 1870 


273 


DNA replication protein DnaC, putative 


ORF02084 


SAG 1871 


248 


conserved hypothetical protein 


ORF02085 


SAG 1872 


200 


hypothetical protein 


ORF02086 


SAG 1873 


443 


replicative DNA helicase 


ORF02087 


SAG 1874 


87 


hypothetical protein 


ORF02088 


SAG 1875 


94 


conserved hypothetical protein , 


ORF02089 


SAG1876 


176 


HNH endonuclease family protein 


ORF02090 


SAG 1877 


236 


antirepressor protein, putative 


ORF02091 


SAG1878 


102 


conserved domain protein 


ORF02092 | 


SAG 1879 


156 


hypothetical protein 


ORF02093 


SAG 1880 


54 


hypothetical protein 


ORF02094 


SAG1881 


51 


hypothetical protein 


ORF02095 


SAG1882 


120 


repressor protein, putative 


ORF02097 


SAG1884 


134 


hypothetical protein ! 


ORF02098 


SAG1885 


356 


site-specific recombinase, phage integrase family 
. . 


ORF02100 


SAG1 886 


32 


hypothetical protein 


ORF02101 


SAG 1887 


con 

boy 


Na+/H+ exchanger family protein 


ORF02102 


SAG 1888 


*70 

to 


hypothetical protein 


ORF02103 


SAG 1889 


ol 1 


microcin immunity protein MccF, putative 


ORF02104 


SAG189U 


OOl 


endopeptidase O 


ORF02105 


SAG 1891 


007 


oxido reductase, Gfo/ldh/MocA family 


I ORF02107 


f» A J nAA 

SAG1 892 


358 


membrane protein, putative 


ORF02108 


SAG1893 


59 


hypothetical protein 


ORF02109 


SAG1894 


214 


Cyclic nucleotide-binding domain protein 


UKrUzliU 


O A -i one 




polypeptide ueiorrnyiaoe 


ORF02111 


SAG1896 


333 


sugar binding transcriptional regulator RegR 


ORF02112 


SAG1897 


634 


conserved hypothetical protein ; 


ORF02113 


SAG1898 


271 


PTS system, IID component 


ORF02114 


SAG1899 


288 


PTS system, 110 component 


ORF02115 


SAG1900 


164 


PTS system, IIB component 


ORF02116 


SAG1901 


398 


glucuronyl hydrolase 


ORF02118 


SAG1902 


144 


PTS system, HA component 


ORF02119 


SAG1903 


34 


hypothetical protein 



699 



WO 2004/018646 



PCT/US2003/026827 



Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF02120 


SAG 1904 

} ! 


270 


oxido reductase, short-chain dehydrogenase/reductase 
family 


ORF02121 


SAG1905 


212 


conserved hypothetical protein 


ORF02122 


SAG1906 


335 


carbohydrate kinase, PfkB family j 


ORF02123 


SAG1907 


212 


2-dehydro-3-deoxyphosphog luconate aldoiase/4- 
hydroxy-2-oxoglutarate aldolase 


ORF02124 


SAG1908 


499 


hypothetical protein 


ORF02125 


SAG1909 


204 


nitroreductase family protein 


ORF02126 


SAG1910 


141 


transcriptional regulator, MarR family 


ORF02127 


SAG1911 


1468 


DNA polymerase III, alpha subunit, Gram-positive type 


ORF02128 


SAG1912 


194 


N-acetylmuramoyl-L-alanine amidase, family 4 protein 


ORF02129 


SAG1913 


617 


prolyl-tRNA synthetase i 


ORF02130 


SAG1914 


419 


membrane-associated zinc metalloprotease, putative 


ORF02131 


SAG1915 


264 


phosphatidate cytidylyltransferase 


ORF02132 


SAG1916 


250 


undecaprenyl diphosphate synthase 


ORF02133 


SAG1917 


113 


preprotein translocase, YajC subunit 


ORF02134 


SAG1918 


114 


conserved hypothetical protein 


ORF02135 


SAG1919 


387 


malate oxidoreductase 


ORF02136 


SAG 1920 


445 


citrate carrier protein, CCS family 


ORF02137 


SAG1921 


508 


sensor histidine kinase family protein 


ORF02138 


SAG1922 


229 


response regulator 


ORF02139 


SAG 1923 


331 


UDP-glucose 4-epimerase 


ORF02140 


SAG 1924 


535 


glucan 1 ,6-alpha-glucosidase 


ORF02141 


SAG 1925 


377 


sugar ABC transporter, ATP-binding protein 


ORF02142 


SAG 1926 | 


283 


helix-turn-helix domain protein, fis-type 


ORF02143 


SAG 1927 


298 


lacX protein 


ORF02144 


SAG 1928 


325 


tagatose 1 ,6-diphosphate aldolase 


ORF02145 


SAG1929 


310 


tagatose-6-phosphate kinase 


ORF02146 


SAG1930 


171 


galactose-6-phosphate isomerase, LacB subunit 


ORF02147 


SAG1931 


141 


galactose-6-phosphate isomerase, LacA subunit 


ORF02148 


SAG1932 


816 


neuraminidase 


ORF02149 


SAG1933 


482 


PTS system, NC component, putative 


ORF02150 


SAG1934 


101 


PTS system, II B component, putative 


ORF02152 


SAG1935 


157 


PTS system, MA component, putative 


ORF02153 


SAG1936 


258 


lactose phosphotransferase system repressor 


ORF02156 


SAG1937 




streptococcal histidine triad family protein, degenerate, 
FRAMESHIFT 


ORF02157 


SAG1938 


307 


adhesion lipoprotein, putative 


ORF02158 


SAG1 939 


147 


conserved hypothetical protein TIGR00256 


ORF02159 


SAG1 940 


738 


GTP pyrophosphokinase 


ORF02160 


SAG1941 


800 


2 ,3*-cyclic-nucleotide 2*-phosphodiesterase 


ORF02161 


SAG1942 


151 


nrdl protein, putative 


ORF02162 


SAG1943 


345 


conserved hypothetical protein 


ORF02163 


SAG1944 


165 


conserved hypothetical protein 


ORF02164 


SAG 1945 


345 


iron ABC transporter, iron-binding protein 


ORF02165 


SAG1946 


257 


DNA-binding response regulator 


ORF02166 


SAG1947 


549 


conserved hypothetical protein 


ORF02167 


SAG1948 


275 


PTS system, I ID component 


ORF02168 


SAG1949 


269 


PTS system, IIC component 


ORF02169 


SAG1950 


163 


PTS system, I IB component 


ORF02170 


SAG1951 


141 


PTS system, IIA component, putative 


ORF02171 


SAG1952 


353 


membrane protein, putative 


ORF02172 


SAG1953 


60 


hypothetical protein 



700 



WO 2004/018646 



PCT/US2003/026827 



Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF02173 


SAG1954 


384 


hypothetical protein 


ORF02174 


SAG 1955 


282 


ABC transporter, ATP-binding protein 


I ORF02175 


SAG 1956 


96 


conserved domain protein 


ORF02176 


SAG1957 


250 


response regulator 


ORF02177 


SAG1958 


276 


conserved hypothetical protein 


ORF02178 


SAG1959 


727 


PTS system, IIABC components 


ORF02179 


SAG1960 


551 


sensor histidine kinase 


ORF02180 


SAG1961 


225 


phosphate regulon response regulator PhoB 


ORF02181 


SAG1962 


218 


phosphate transport system regulatory protein PhoU, 
putative 


ORF02182 


SAG1963 


253 


phosphate ABC transporter, ATP-binding protein 


ORF02183 


SAG1964 


292 


phosphate ABC transporter, permease protein 


ORF02184 


, SAG1965 


281 


phosphate ABC transporter, permease protein 


ORF02186 


SAG1966 


293 


hemolysin precursor, putative 


ORF02187 


SAG1967 


195 


hypothetical protein 


ORF02188 


SAG1968 


246 


conserved hypothetical protein TIGR00046 


ORF02189 


SAG 1969 


317 


ribosomal protein L1 1 methyltransferase 


ORF02190 


SAG 1970 


102 


conserved hypothetical protein 


ORF02191 


SAG1971 


41 


hypothetical protein 


| ORF02192 


SAG 1972 


238 


transcriptional regulator, MerR family 


ORF02194 


SAG1973 


156 


acetyltransferase, GNAT family 


j ORF02195 


SAG 1974 


152 


MutT/nudix family protein 


ORF02196 


SAG1975 


47 


hypothetical protein 


ORF02197 


SAG1976 


156 


conserved hypothetical protein 


ORF02198 


SAG 1977 


163 


acetyltransferase, GNAT family 


ORF02199 


SAG1978 


422 


ATPase, AAA family 


ORF02201 


SAG1979 


253 


hypothetical protein 


ORF02202 


SAG1980 


300 


ABC transporter, ATP-binding protein 


ORF02203 


SAG 1981 


68 


hypothetical protein 


ORF02205 


SAG1982 


359 


transcriptional regulator, Cro/CI family 


ORF02206 


SAG1983 


105 


conserved hypothetical protein 


ORF02207 


SAG1984 


188 


conserved hypothetical protein T1GR00730 


ORF02208 


SAG1985 


51 


hypothetical protein 


ORF02209 


SAG1986 


375 


integrase, phage family, putative 


ORF02210 


SAG1987 


61 


conserved hypothetical protein 


ORF0221 1 


SAG1988 


342 


conserved hypothetical protein 


ORF02212 


SAG1989 


139 


hypothetical protein 


ORF02213 


SAG1990 


127 


hypothetical protein 


ORF02214 


SAG1991 


204 


transcriptional regulator, Cro/CI family 


ORF02215 


SAG1992 


518 


conserved hypothetical protein 


ORF02216 


SAG1993 


373 


site-specific recombinase, phage integrase family 


ORF02217 


SAG 1994 


108 


conserved hypothetical protein 


ORF02219 


SAG 1995 


210 


hypothetical protein 


ORF02221 


SAG1996 


263 


cell wall anchor protein-related protein 


ORF02223 


SAG1997 


182 


hypothetical protein 


ORF02224 


SAG 1998 


457 


hypothetical protein 


ORF02225 


SAG1999 


47 


hypothetical protein 


ORF02226 


SAG2000 


666 


membrane protein, putative 


ORF02227 


SAG2001 


756 


conjugal transfer protein, interruption-C 


ORF02228 


SAG2002 


129 


IS1381, transposase OrfB 


ORF02229 


SAG2003 


127 


IS1 381 , transposase OrfA I 


ORF02230 


SAG2005 


136 


conserved hypothetical protein 


ORF02231 


SAG2006 


88 


conserved hypothetical protein 


ORF02232 


SAG2007 


317 


conserved hypothetical protein 
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ORF02233 


bAGzUUo 


84 


conserved hypothetical protein 


ORF02234 


bAGzUUbs 


88 


conserved hypothetical protein 


ORF02235 


bAG201 0 


157 


hypothetical protein 


ORF02236 


o a porn H 
oAo2U1 1 


160 


conserved hypothetical protein 


ORF02237 


oa oon-i o 


90 


hypothetical protein 


ORrU223o 


oAGzUlo 


189 


hypothetical protein 


ORF02239 


bAG2014 


449 


hypothetical protein 


ORF02240 ! 


SAG2015 


99 


transcriptional regulator, Cro/CI family 


I ORF02241 


SAG2016 


125 


hypothetical protein 


ORF02242 


SAG2017 


429 


transcriptional regulator, Cro/CI family 


ORF02243 


SAG2018 


553 


FtsK/SpolllE family protein 


ORF02244 


SAG2019 


153 


hypothetical protein 


ORF02245 


SAG2020 


98 


hypothetical protein 


ORF02246 


SAG2021 


826 


cell wall surface anchor family protein 


ORF02247 


SAG2022 


417 


transposase, ISL3 family 


ORF02249 


SAG2023 


546 


mercuric reductase 


ORF02250 


SAG2024 


130 


mercuric resistance operon regulatory protein MerR 


ORF02251 


0 a Aonoc 

SAG2025 


522 


ivln2+/Fe2+ transporter, NKAlvir 1 Tamily 


ORF02252 


0 a r>nnoo 

SAG2026 


240 


membrane protein, putative 


ORr0225o 


OA A0H07 

bAG2U2 f 


205 


ABC transporter, ATP-binding protein 


ORF02254 


O A pnrtOO 


36 


conserved hypothetical protein 


ORF02255 


SAG2029 


OO A 

284 


streptomycin resistance protein 


ORF02257 




a OA 


hypothetical protein 


QKr022oo 


O A POAO H 


2U2 


hypothetical protein 


ORF02259 


0 a pono 0 

SAG2032 


AAA 

111 


conserved hypothetical protein 


ORF02260 


O A Z'"* OAO O 

SAG2033 


162 


acetyltransferase, GNAT family 


ORF022o1 


0 A pono A 


Ovl T 

ZAl 


membrane protein, putative 


<JRrU22o2 




OAA 

oUU 


ABC transporter, ATP-binding protein 


ORF02263 


O A POAOC 


bo 


hypothetical protein 


ORF02264 


SAG2037 


358 


transcriptional regulator, Cro/CI family 


ORrU22bo 


0 a por\o 0 

bAG2Uoo 


OA A 

204 


hahz Tamuy protein 


ORFQ2266 


0 a ponon 

SAG2039 


98 


conserved hypothetical protein 


ORF02267 


SAG2040 


186 


conserved hypothetical protein TIGR00730 


ORF02268 


SAG2041 


287 


protease, putative 


ORF02269 


SAG2042 


100 


rhodanese family protein 


ORF02270 


SAG2043 


255 


cAMP factor 


ORF02271 


SAG2044 


62 


hypothetical protein 


ORF02272 


SAG2045 


179 


DNA topology modulation protein FiaR, putative 


ORF02273 


SAG2046 


361 


glycerol dehydrogenase, putative 


ORF02274 


SAG2047 


235 


conserved hypothetical protein 


ORF02275 


SAG2048 


614 


5-methyltetrahydrofolate-homocysteine 
methyltransf erase, putative 


ORF02276 


SAG2049 


745 


5-methyitetrahydropteroyltriglutamate-homocysteine 
methyltransferase 


ORF02277 


SAG2050 


107 


conserved hypothetical protein 


ORF02278 


SAG2051 


230 


branched-chain amino acid transport protein AziC, 
putative 


ORF02279 


SAG2052 


41 


hypothetical protein 


ORF02280 


SAG2053 


1570 


serine protease, subtilase family, putative 


ORF02281 


SAG2054 


228 


DNA-binding response regulator 


ORF02282 


SAG2055 


462 


sensor histidine kinase 


ORF02283 


SAG2056 


202 


chromosome assembly-related protein 


ORF02285 


SAG2057 


833 


leucyl-tRNA synthetase 


ORF02286 


SAG2058 


415 


major facilitator family protein 
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ORF02287 


SAG2059 


281 


conserved hypothetical protein 


ORF02288 


SAG2060 


398 


glycosyl transferase, family 8 


ORF02289 


SAG2061 


401 


glycosyl transferase, family 8 


ORF02290 


SAG2062 


179 


transcription antitermi nation protein NusG 


ORF02291 


SAG2063 


630 


pathogenicity protein, putative 


ORF02292 


SAG2064 


57 


preprotein translocase, SecE subunit, putative 


| ORF02293 


SAG2066 


773 


penicillin-binding protein 2A 


ORF02294 


SAG2067 


294 


ribosomal large subunit pseudouridine synthase, RluD 
subfamily 


ORF02295 


SAG2068 


546 


Lyme disease proteins of unknown function, putative 


ORF02296 


SAG2069 


403 


phosphopentomutase 


ORF02297 


SAG2070 


223 


deoxyribose-phosphate aldolase 


ORF02298 


SAG2071 


400 


Na+ dependent nucleoside transporter 


ORF02300 


SAG2072 


259 


uridine phosphorylase 


ORF02301 


SAG2073 


245 


transcriptional regulator, GntR family 


ORF02302 


SAG2074 


540 


60 kda chaperonin 


ORF02303 


SAG2075 


94 


chaperonin, 10 kDa 


ORF02305 


SAG2076 


267 


ABC transporter, ATP-binding protein 


ORF02306 


SAG2077 


298 


ABC transporter, permease protein 


ORF02307 


SAG2078 


320 


lipoprotein, putative 


ORF02308 


SAG2079 


265 


hydrolase, haloacid dehalogenase-like family 


| ORF02309 


SAG2080 


286 


glyoxalase family protein 


| ORF02310 


SAG2081 


243 


conserved hypothetical protein 


ORF02311 | 


SAG2082 


205 


anaerobic ribonucleoside-triphosphate reductase 
activating protein 


ORF02312 


SAG2083 


163 


acetyitransferase, GNAT family 


ORF02313 


SAG2084 


310 


virulence factor MviM, putative 


ORF02314 


SAG2085 


47 


conserved hypothetical protein 


ORF02315 


SAG2086 


723 


anaerobic ribonucleoside-triphosphate reductase 


ORF02316 


SAG2087 


495 


conserved hypothetical protein 


ORF02317 


SAG2088 


40 


hypothetical protein 


ORF02318 


SAG2089 


105 


conserved hypothetical protein 


ORF02319 


SAG2090 


136 


conserved hypothetical protein TIGR00250 


ORF02320 


SAG2091 


88 


conserved hypothetical protein 


ORF02321 


SAG2092 


132 


conserved hypothetical protein 


ORF02322 


SAG2093 


379 


recA protein 


ORF02323 


SAG2094 




competence/damage-inducible protein CinA 
FRAMESHIFT 


ORF02325 


SAG2095 


183 


DNA-3-methyladenine glycosylase I 


ORF02327 


SAG2096 


196 


Holliday junction DNA helicase RuvA 


ORF02328 


SAG2097 


418 


transporter, putative 


ORF02329 


SAG2098 


659 


DNA mismatch repair protein HexB 


ORF02330 


SAG2099 


33 


hypothetical protein 


ORF02331 


SAG2100 


67 


cold shock protein, CSD family 


ORF02332 


SAG2101 


858 


DNA mismatch repair protein HexA 


ORF02333 


SAG2102 


145 


arginine repressor ArgR, putative 


ORF02334 


SAG2103 


563 


arginyl-tRNA synthetase 


ORF02335 


SAG2104 


102 


conserved hypothetical protein 


ORF02337 


SAG2105 


290 


conserved hypothetical protein 


ORF02338 


SAG2106 


314 


conserved hypothetical protein 


ORF02339 


SAG2107 


583 


aspartyl-tRNA synthetase 


ORF02340 


SAG2108 


426 


histidyl-tRNA synthetase 


ORF02341 


SAG2109 


60 


ribosomal protein L32 
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ORF02342 


SAG2110 


49 


ribosomal protein L33 


ORF02343 


SAG2111 


173 


conserved hypothetical protein 


ORF02344 


SAG2112 


494 


site-specific recombinase, phage integrase family 


ORF02345 


SAG2113 


82 j 


conserved hypothetical protein 


ORF02346 


SAG2114 


342 


conserved hypothetical protein 


ORF02347 


SAG2115 


143 


hypothetical protein 


ORF02349 


SAG2116 


151 


conserved hypothetical protein 


ORF02350 


SAG2117 


71 


hypothetical protein 


ORF02351 


SAG2118 


306 


transcriptional regulator, Cro/CI family 


ORF02352 


SAG2119 


373 


conserved domain protein 


ORF02355 


SAG2120 


56 


hypothetical protein 


ORF02356 


SAG2121 


176 


hypothetical protein 


ORF02357 


SAG2122 


223 


DNA-binding response regulator 


ORF02358 


SAG2123 


454 


sensor histidine kinase 


ORF02359 


SAG2124 


517 


membrane protein, putative 


ORF02360 


SAG2125 


308 


carbamate kinase 


ORF02361 


SAG2126 


332 


ornithine carbamoyltransferase 


ORF02362 


SAG2127 


431 


sensor histidine kinase 


ORF02363 


SAG2128 


277 


response regulator 


ORF02364 


SAG2129 


240 


amino acid ABC transporter, ATP-binding protein 


ORF02365 


SAG2130 


504 


amino acid ABC transporter, permease and amino acid 








binding protein 


ORF02367 


SAG2131 


847 


membrane protein, putative 


ORF02368 


SAG2132 


247 


conserved hypothetical protein 


ORF02369 


SAG2133 


118 


conserved hypothetical protein 


ORF02370 


SAG2134 


772 


membrane protein, putative 


ORF02371 


SAG2135 


179 


transcriptional regulator, TetR family, putative 


ORF02372 


SAG2136 


98 


conserved hypothetical protein 


ORF02373 


SAG2137 


203 


ribosomal protein S4 


ORF02374 


SAG2138 


95 


conserved hypothetical protein 


ORF02375 


SAG2139 


451 


replicative DNA helicase 


ORF02376 


SAG2140 


150 


ribosomal protein L9 


ORF02377 


SAG2141 


660 


DHH family protein 


ORF02378 


SAG2142 


613 


glucose inhibited division protein A 


ORF02379 


SAG2143 


203 


conserved hypothetical protein T1GR00427 


ORF02380 


SAG2144 


373 


tRNA (5-methylaminomethyl-2-thiouridylate)- 








methyltransferase 


ORF02381 


SAG2145 


222 


L-serine dehydratase, iron-suifur-dependent, beta 








subunit 


ORF02382 


SAG2146 


290 


L-serine dehydratase, iron-sulfur-dependent, alpha 








subunit 


ORF02383 


SAG2147 


234 


conserved hypothetical protein 


ORF02384 


SAG2148 


179 


LysM domain protein 


ORF02385 


SAG2149 


264 


cobalt transport family protein 


ORF02386 


SAG2150 


280 


ABC transporter, ATP-binding protein | 


ORF02387 


SAG2151 


279 


ABC transporter, ATP-binding protein, FRAMESHIFT 


ORF02388 


SAG2152 


180 


CDP-diacylglycerol-glyceroI-3-phosphate 3- ; 








phosphatidyltransferase 


ORF02389 


SAG2153 


427 


peptidase, M1 6 family 


ORF02390 


SAG2154 


414 


conserved hypothetical protein 


ORF02391 


SAG2155 


117 


conserved hypothetical protein 


ORF02392 


SAG2156 


369 


recF protein 


ORF02393 


SAG2157 


278 


transporter, putative 


ORF02395 


SAG2158 


220 


transcriptional regulator, Cro/CI family 
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ur\ruzoyD 




A OO 

4yo 


inosine-5-monophosphate dehydrogenase 




o a r*0"i cn 


Hoi 


transcriptional regulator, ArgR family 




OM»CJx2. I D 1 




transcnpuonai regulator, orp/rnr ramiiy 






Zo4 


conserveo nypoineucai protein 




OnOZ 1 DO 


n 

4 1 u 


argmine ueiminase 




Or\V3^ 1 D4 


i oc 
1 00 


acetyitransierase, oinmi iamuy 


kJKrUZ4UZ 


oAvjZ 1 DO 


007 
00/ 


ornithine carbamoyltransferase 




QApOl op; 
oAb^TDD 


/17c 
4/ O 


arginine/ornithine antiporter 


vJKrU<i4U4 


CAf^Oi R7 


olo 


carbamate kinase 


UKrUii4U0 


CAPOIftQ 

oAvsZlOO 


! O/t «l 

o41 


tryptophanyl-tRNA synthetase 


UKrU24Uo 


oAva^ioy 




conserved hypothetical protein 


ORF02407 


SAG2170 


290 


conserved hypothetical protein 


ORF02408 


SAG2171 


539 


ABC transporter, ATP-binding protein 


ORF02409 


SAG2172 


859 


ABC transporter, permease protein, putative 


ORF02410 


SAG2173 


159 


conserved hypothetical protein TIGR00246 j 


ORF0241 1 


SAG2174 


409 


serine protease 


ORF02412 


SAG2175 


257 


partitioning protein, ParB family 


ORF0241 3 


SAG0001 


453 


chromosomal replication initiator protein DnaA 


ORF02415 


SAG0002 


378 


DNA polymerase ill, beta subunit j 


ORF02416 


SAG0003 


293 


diacylglycerol kinase catalytic domain protein, putative 


OKr0241 / 


oACjUUU4 


DO 


conserved hypothetical protein 


/~\ I— | J - /~V<-> A A O 

ORF02418 


SAG0005 


67 


hypothetical protein 


ORF02419 


SAG0006 


371 


conserved hypothetical GTP-binding protein 


ORF02420 


SAG0007 


191 


peptidyl-tRNA hydrolase 


! ORF02421 


SAG0008 


1165 


transcription-repair coupling factor 


ORF02422 


SAG0009 


31 


hypothetical protein 


ORF02423 ! 


SAG0010 


90 


S4 domain protein 


ORF02424 


SAG001 1 


123 


cell division protein DiviC, putative 


ORF02425 


SAGO0 12 


44 


conserved hypothetical protein 


ORF02426 


SAGO0 13 


428 


conserved hypothetical protein 


ORF02427 


SAG0014 


424 


MesJ/Ycf62 family protein 


ORF02428 


SAGO0 15 


180 


hypoxanthine-guanine phosphoribosyitransferase 


UKrUZ4Zy 


oAbUUiD 


DOO 


cell division protein FtsH 


vJKrUoUUU 


oAoUlOf 




unase-reiated protein, uttjtNtKAl b 


UKrUoUUI 


O A /*"*rkK*7Q 

oAtsUO/y 


142 


conserved hypothetical protein 


vJKrUoUUU 


bAbUOOU 


1 -i 1 
1 1 1 


conserved hypothetical protein, truncation 


vJKruoUUo 


oAbUOO^ 




1 nozoz, Kjn Zo protein, aegenerate 


^JKrUouU4 


CAPnRW 
OAvjUDOO 


Of 


conserved hypothetical protein 


UKrUoUUO 


QA^ORRO 
OAbUDD^ 




cylX protein 


UKrUouuo 


QA^HQ-I "7 


«o 


Tn916, hypothetical protein 


r^DPnonnv 


oAbUo^U 


Zo 


Tn916, hypothetical protein 


^Dcnonno 




Dl 


inyio, nypotneticai protein 


ORF03009 


SAG0924 


28 


Tn916, tetM leader peptide 


ORF03010 


SAG0936 


39 


Tn916, hypothetical protein 


OKrOoOl 1 
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gi|13621326|gb|AAK33146.1| 
gij 1 3621 327|gb|AAK331 47. 1 1 
gi|13621328|gb|AAK33148.1| 
gi|13621329|gb|AAK33149.1| 
gi|1 3621 330|gb|AAK33150.1 j 
gi|13621331|gb|AAK33151.1| 
gi|1 3621 332|gb|AAK33152.1 1 
gi|1 3621 333|gbjAAK331 53.1 1 
gi|1 3621 334 jgb]AAK331 54.1 j 
gi|1 3621 335|gb|AAK331 55.1 1 
gi|1 3621 337|gb|AAK331 56.1 1 
gi|13621340|gb|AAK33158.1j 
gi|13621341|gb|AAK33159.1| 
gi|13621343Igb|AAK33160.1| 
gi|13621344|gb|AAK33161.1| 
gi|1 3621 346|gb|AAK331 63.1 1 
gi|13621347|gb|AAK33164.1| 
gi|1 3621 348|gb|AAK331 65.1 1 
gi|13621349|gb|AAK33166.1| 
gi|1 3621 350|gb|AAK331 67.1 1 
gi|1 3621 353|gb|AAK331 69.1 j 
gi|1 3621 354Igb|AAK331 70.1 j 
gi|1 3621 355|gb|AAK331 71 .1 1 
gj|1 3621 357|gb|AAK331 73.1 1 
gj|13621358|gb|AAK33174.1| 
gj|13621359lgb|AAK33175.1| 
gi|13621361|gb|AAK33176.1| 
gi|1 3621 362|gb|AAK331 77. 1 j 
gi|1 3621 363|gb|AAK331 78.1 j 
gi|13621364|gb|AAK33179.1| 
gi|1 3621 365|gb[AAK331 80.1 1 
gi|1 3621 366|gb|AAK331 81 .1 j 
gi|1 3621 367|gb|AAK331 82.1 1 
gi|1 3621 368|gbIAAK331 83.1 1 
gi|1 3621 369|gb|AAK331 84.1 1 
gi|1 3621 370|gb|AAK331 85.1 1 
gi|13621372|gb|AAK33186.1| 
gi|13621373|gb|AAK33187.1I 
gi|1 3621 374|gb|AAK331 88.1 1 
gi|1 3621 375|gb|AAK331 89.1 j 
gi|1 3621 376|gb|AAK331 90.1 1 
gi|1 3621 377|gb|AAK331 91 .1 1 
gi|1 3621 378|gb|AAK331 92.1 1 
gi|1 3621 379|gb|AAK331 93. 1 j 
gi|1 3621 380|gb|AAK331 94. 1 1 
gi|13621382|gb|AAK33196.1| 
gj|1 3621 383|gb|AAK33 197.1 1 
gi|1 3621 384|gb|AAK331 98.1 1 
gi|1 3621 385|gb|AAK331 99.1 1 
gi|1 3621 386|gb|AAK33200.1 1 
gi|1 3621 387|gb|AAK33201 .1 1 
gi|1 3621 388|gb|AAK33202. 1 1 
gi|1 3621 389|gb|AAK33203.1 1 
gi|1 3621 390|gb|AAK33204.1 j 
gi|1 3621 391 |gb|AAK33205.1 1 
gi|1 3621 392|gb|AAK33206.1 1 



gi|13621393jgb|AAK33207.1| 
gi|13621394|gb|AAK33208.1| 
gi|13621397|gb|AAK33210.1| 
gi|13621 398[gb|AAK3321 1 .1 1 
gi|13621399|gb|AAK33212.1| 
gi|13621401|gb|AAK33214.1| 
gi|1 3621403|gb|AAK3321 5.1 j 
gi|13621404|gb|AAK33216.1| 
gi|13621405|gb|AAK33217.1| 
gi|13621407|gb|AAK33218.1| 
gi|13621408|gb|AAK33219.1| 
gi|13621409|gb|AAK33220.1| 
gi|13621413|gb|AAK33224.1| 
gi|13621415|gb|AAK33226.1| 
gi|13621416|gb|AAK33227.1| 
gi|13621418|gb|AAK33229.1| 
gi|13621419|gb|AAK33230.1| 
gi|13621424|gb|AAK33234.1| 
gi|13621425|gb|AAK33235.1 1 
gi|13621426|gb|AAK33236.1 1 
gi|13621434|gb|AAK33243.1 1 
gi 1 1 3621 450 |gb | AAK33258 . 1 1 
gi j 1 3621 455|gb j AAK33262 . 1 1 
gi|13621456|gb|AAK33263.1| 
gi|13621457|gb|AAK33264.1 1 
gi 1 1 3621 467|gb|AAK33273. 1 1 
gi 1 1 3621 468 |gb|AAK33274 . 1 1 
gi|13621469|gb|AAK33275.1| 
gi|13621470|gb|AAK33276.1| 
gi|13621471|gb|AAK33277.1| 
gi|13621472|gb|AAK33278.1 1 
gi|13621473|gb|MK33279.1| 
gi|13621476|gb|AAK33281.1i 
gi|13621477|gb|AAK33282.1 
gi 1 1 3621 478 |gb | AAK33283. 1 
gi|13621480|gb|AAK33285.1 
gi|13621481 |gb|AAK33286.1 
gi|13621491 |gb|AAK33295.1 
gi 1 1 362 1 494 |gb| AAK33298 . 1 
gi 1 1 362 1 496 |gb| AAK33299. 1 
gi|13621501|gb|AAK33304.1 
gi|13621502|gb|AAK33305.1 
gi|13621505|gb|AAK33307.1 
gi|13621506|gb|AAK33308.1 
gi|13621507|gb|AAK33309.1 
gi 1 1 362 1 51 0 |gb| AAK333 12.1 
gi|13621511|gb|AAK33313.1 
gi|13621513|gb|AAK33315.1 
gi|13621516|gb|AAK33317.1 
gi|13621518|gb|AAK33319.1 
gi|1 3621 521 |gb|AAK33322.1 
gi 1 1 362 1 522 jg b | AAK33323 . 1 
gi 1 1 362 1 523 jgb | AAK33324 . 1 
gi|13621524|gb|AAK33325.1 
gi|13621525|gb|AAK33326.1 
gi|13621527|gb|AAK33327.1 
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gi|1 3621 528|gb|AAK33328. 1 1 
gi|1 3621 529|gb|AAK33329. 1 1 
gi|13621530|gb|AAK33330.1| 
gi|1 3621 531 |gb|AAK33331 .1 1 
gi|13621532|gb|AAK33332.1| 
gi|1 3621 533|gb|AAK33333. 1 1 
gi|13621534|gb|AAK33334.1] 
gi|13621535|gb|AAK33335.1 1 
gi|13621536|gb|AAK33336.1 1 
gi|13621537|gb|AAK33337.1 1 
gi|13621539|gb|AAK33338.1| 
gi|1 3621 540|gb|AAK33339.1 1 
gi|1 3621 541 |gb|AAK33340.1 1 
gi|13621542|gb|AAK33341.1| 
gi|13621543|gb|AAK33342.1| 
gi|13621544|gb|AAK33343.1 1 
gi|1 3621 546|gb|AAK33345.1 1 
gi|1 3621 547|gbIAAK33346.1 1 
gi|1 3621 548|gb|AAK33347.1 
gi|13621550|gb|AAK33348.1 
gi|1 3621 551 |gb|AAK33349.1 
gi|13621552|gb|AAK33350.1 
gi|1 3621 553|gb|AAK33351 .1 
gi|13621554|gb|AAK33352.1 
gi|13621555|gb|AAK33353.1 
gi|1 3621 557|gb|AAK33355.1 
gi|1 3621 559|gb|AAK33356.1 
gi 1 1 362 1 560 |gb j AAK33357 . 1 
gi|1 3621 561 |gb|AAK33358.1 
gi|1 3621 562|gb|AAK33359.1 
gi|13621563|gb|AAK33360.1 
gi|1 3621 564|gb|AAK33361 .1 
gi|13621565|gb|AAK33362.1 
gi|13621566|gb|AAK33363.1 
gi|13621567|gb|AAK33364.1 
gi|13621569|gbiAAK33365.1 
- gi|13621571|gb|AAK33367.1 
gi|13621572|gb|AAK33368.1 
gi|13621573|gb|AAK33369.1 
gi|13621574|gb|AAK33370.1 
gi|13621 575|gb|AAK33371 .1 
gi|13621 576|gb|AAK33372.1 
gi|1 3621 577|gb|AAK33373. 1 
gi|13621579|gb|AAK33374.1 
gi|13621581|gb|AAK33376.1 
gi|13621582|gb|AAK33377.1 
gi|13621583|gb|AAK33378.1 
gi|13621584|gb|AAK33379.1 
gi|13621585|gb|AAK33380.1 
gi|13621586|gb|AAK33381.1 
gi|13621588|gb|AAK33383.1 
gi|1 3621 589|gb|AAK33384.1 
gi|13621590|gb|AAK33385.1 
gi|13621592|gb|AAK33386.1 
gi|13621593|gb|AAK33387.1 
gi|13621594|gb|AAK33388.1 



gi| 1 362 1 595|gb| AAK33389. 1 
gi|13621596|gb|AAK33390.1 
gi|1 3621 597|gbIAAK33391 .1 
gi|13621598|gb|AAK33392.1 
gi|13621599|gb|AAK33393.1 
gi|13621600|gb|AAK33394.1 
gi|1 3621 602|gb|AAK33395. 1 
gi|13621603|gb|AAK33396.1 
gi|13621604|gb|AAK33397.1 
gi| 1 362 1 605|gb| AAK33398. 1 
gi|13621606|gb|AAK33399.1 
gi|1 3621 607|gb|AAK33400. 1 
gi|1 3621 608|gb|AAK33401 .1 
gi|13621609|gb|AAK33402.1 
gi|1 3621 61 1 |gb|AAK33404.1 
gi|13621614|gb|AAK33406.1 
gi|1 3621 61 5|gb|AAK33407.1 
gi|13621616|gb|AAK33408.1 
gi|13621617|gb|AAK33409.1 
gi|13621618|gb|AAK33410.1 
gi|13621619|gb|AAK33411.1 
gi|1 3621 620|gb|AAK33412.1 
gi|1 3621 621 |gb|AAK3341 3.1 
gi|13621622|gb|AAK33414.1 
gj|1 3621 623|gb|AAK3341 5.1 
gi|13621624|gb|AAK33416.1 
gi|1 3621 625|gb| AAK3341 7. 1 
gi| 1 3621 627|gb| AAK334 19.1 
gi|13621629|gb|AAK33420.1 
gi|13621630|gb|AAK33421.1 
gi|1 3621631 |gb|AAK33422.1 
gi|13621633|gb|AAK33424.1 
gi|13621634|gb|AAK33425.1 
gi|13621636|gb|AAK33427.1 
gi|13621637|gb|AAK33428.1 
gi|13621638|gb|AAK33429.1 
gi|1 3621 640|gb|AAK33430.1 
gi|1 3621 642|gb|AAK33432.1 
gi|13621644|gb|AAK33434.1 
gi|13621645|gb|AAK33435.1 
gi|13621647|gb|AAK33437.1 
gi|13621648|gb|AAK33438.1 
gi|13621650|gb|AAK33440.1 
gi|1 3621 651 |gb|AAK33441 .1 
gi|13621652|gb|AAK33442.1 
gi|13621657|gb|AAK33446.1 
gi|1 3621 658|gb|AAK33447.1 
gi|1 3621 660|gb|AAK33449.1 
gi|13621670|gb|AAK33458.1 
gj|13621671|gb|AAK33459.1 
gi|1 3621 672|gb|AAK33460.1 
gi|13621675|gb|AAK33462.1 
gi|13621676|gb(AAK33463.1 
gi|13621678|gb|AAK33465.1 
gij 1 3621 680|gb|AAK33467. 1 
gi|1 3621 681 |gb|AAK33468.1 
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gi|13621682|gb|AAK33469.1 
gi|13621683|gb|AAK33470.1 
gi|13621684|gb|AAK33471.1 
gi|1 3621 685|gb|AAK33472.1 
gi|13621688|gb|AAK33474.1 
gi| 13621 689|gb|AAK33475. 1 
gi|13621690|gb|AAK33476.1 
gi 1 1 3621 69 1 |gb |AAK33477. 1 
gi| 1 362 1 692 |g b | AAK33478 . 1 
gi|13621693|gb|AAK33479.1 
gi|13621694|gb|AAK33480.1 
gi|13621695|gb|AAK33481.1 
gi 1 1 3621 697 |g b | AAK33483 . 1 
gi|13621698|gb|AAK33484.1 
gi|13621700|gb|AAK33485.1 
gi|1 3621 701 jgb|AAK33486.1 
gi|1 3621 702|gb|AAK33487.1 
gi|13621714|gb|AAK33498.1 
gi|13621715|gb|AAK33499.1 
gi| 13621 71 7|gb|AAK33501 .1 
gi|1 3621 71 8|gb|AAK33502.1 
gi|13621719|gb|AAK33603.1 
gi|13621720|gb|AAK33504.1 
gi|13621726|gb|AAK33509.1 
gi|1 3621 727|gb|AAK3351 0.1 
gi|13621729|gb|AAK33512.1 
gi| 1 3621 730|gb| AAK3351 3.1 
gi|13621731 |gb|AAK33514.1 
gi|1 3621 732|gb|AAK3351 5. 1 
gi|1 3621 733|gb|AAK3351 6.1 
gi|1 3621 734|gb|AAK3351 7.1 
gi| 1 3621 735|gb|AAK3351 8. 1 
gi|1 3621 736|gb|AAK3351 9.1 
gi|1 3621 741 |gb|AAK33523. 1 
gi|1 3621 742|gb|AAK33524.1 
gi|1 3621 743|gb|AAK33525. 1 
gi|1 3621 744|gb|AAK33526. 1 
gi|1 3621 745|gb|AAK33527.1 
gi|1 3621 747|gb|AAK33528.1 
gi| 1 3621 756 |gb| AAK33537. 1 
gi| 1 3621 773 |gb( AAK33552 . 1 
gi|13621774|gb|AAK33553.1 
gi| 1 3621 775|gb|AAK33554. 1 
gi|13621777|gb|AAK33556.1 
gi|13621778|gb|AAK33557.1 
gi|13621779|gb|AAK33558.1 
gi|13621781 |gb[AAK33559.1 
gi|1 3621 782|gb[AAK33560.1 
gi|13621785|gb|AAK33563.1 
gi|1 3621 786|gb|AAK33564.1 
gi|13621787|gb|AAK33565.1 
gi|13621788Igb|AAK33566.1 
gi|13621789|gb|AAK33567.1 
gi|13621790|gb|AAK33568.1 
gi|1 3621 793|gb|AAK33571 .1 
gi|13621794|gb|AAK33572.1 



gi|13621796|gb|AAK33573.1 
gi|13621797|gb|AAK33574.1 
gi|1 3621 799|gb|AAK33576. 1 
3621800|gb|AAK33577.1 
3621802|gb|AAK33579.1 
3621806|gb|AAK33583.1 
3621808|gb|AAK33584.1 
3621809|gb|AAK33585.1 
3621 81 0|gb|AAK33586. 1 
362181 1|gb|AAK33587.1 
3621 81 2|gb|AAK33588. 1 
3621813|gb|AAK33589.1 
3621814|gb|AAK33590.1 
3621817|gb|AAK33592.1 
3621818|gb|AAK33593.1 
3621 8 1 9|gb|AAK33594. 1 
3621820|gb|AAK33595.1 

3621 821 |gb|AAK33596.1 

3621 822 |gb|AAK33597.1 
3621823|gb|AAK33598.1 
3621824|gb|AAK33599.1 
362 1 825|gb|AAK33600. 1 
3621826|gb|AAK33601.1 
3621828|gb|AAK33602.1 
3621 829|gb|AAK33603.1 
3621830|gb|AAK33604.1 
3621 831 |gb|AAK33605.1 
3621834|gb|AAK33608.1 
3621835|gb|AAK33609.1 
3621 836|gb|AAK3361 0.1 
3621 837|gb|AAK3361 1.1 
3621 839|gbiAAK3361 2.1 
3621 840|gb|AAK3361 3.1 
3621841 |gb|AAK336 14.1 
3621 842|gb|AAK3361 5.1 
3621 843|gb|AAK3361 6.1 
3621 844|gb|AAK3361 7.1 
3621898|gb|AAK33667.1 
3621901 |gb|AAK33670.1 
3621902|gb|AAK33671 .1 
3621904|gb|AAK33672.1 
3621907|gb|AAK33675.1 
3621908Igb|AAK33676.1 
3621909|gb|AAK33677.1 
3621910|gb|AAK33678.1 
3621912|gb|AAK33680.1 
3621924|gb|AAK33690.1 
3621929|gb|AAK33694.1 
3621930Igb|AAK33695.1 
3621 931 |gb|AAK33696.1 
3621 933 |gb|AAK33698. 1 
3621934|gb|AAK33699.1 
3621936|gb|AAK33700.1 

gi|13621936Igb|AAK33701.1 
gi|1 3621 937|gb|AAK33702.1 
gi|13621938|gb|AAK33703.1 
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gi|1 3621 939Igb|AAK33704.1 1 
gi|1 3621 942|gb|AAK33706. 1 1 
gi|13621944|gb|AAK33708.1 1 
gi|13621945|gb|AAK33709.1 1 
gi|1 3621 946|gb|AAK3371 0.1 1 
gi|1 3621 950|gb|AAK3371 4. 1 j 
gi|1 3621 953|gb|AAK3371 6. 1 j 
gi|13621954|gb|AAK33717.1| 
gi|13621955|gb|AAK33718.1| 
gi|13621956|gb|AAK33719.1| 
gi|13621957|gb|AAK33720.1 1 
gi 1 1 3621 958 |gb | AAK3372 1 . 1 1 
gi|13621959|gb|AAK33722.1 1 
gi|1 3621961 |gb|AAK33723.1 1 
gi|13621975|gb|AAK33736.1 1 
gi| 1 3621 977|gb|AAK33738 . 1 1 
gi 1 1 362 1 978 |gb| AAK33739 . 1 j 
gi|13621979|gb|AAK33740.1 1 
gi|13621980|gb|AAK33741 .1 1 
gi|1 3621981 |gb|AAK33742.1 1 
gi|13621982|gb|AAK33743.1 1 
gi|13621985|gb|AAK33745.1 1 
gi 1 1 362 1 986 jgb j AAK33746 . 1 1 
gi|13621987|gb|AAK33747.1 1 
gi|13621989|gb|AAK33749.1 1 
gj|13621990|gb|AAK33750.1 1 
gi|13621992|gb|AAK33752.1 1 
gi|13621993|gb|AAK33753.1 j 
gi|1 3621 994|gb|AAK33754.1 1 
gi|13621996|gb|AAK33755.1 1 
gi|13621997|gb|AAK33756.1| 
gi|13621998|gb|AAK33757.1 1 
gi|13621999|gb|AAK33758.1 1 
gi|13622000|gb|AAK33759.1 1 
gi]1 3622001 |gb|AAK33760.1 1 
gi|13622002|gb|AAK33761 .1 1 
gi|13622003|gb|AAK33762.1 1 
gi|13622004|gb|AAK33763.1 1 
gi|13622005|gb|AAK33764.1 1 
gi|13622006|gb|AAK33765.1 j 
gi|13622008|gb|AAK33766.1 1 
gi|13622009|gb|AAK33767.1 1 
gi|13622010|gbjAAK33768.1 1 
gi|13622012|gb|AAK33770.1| 
gi|13622013|gblAAK33771.1| 
gi|1 362201 7|gb|AAK33774.1 1 
gi|1 362201 8|gb|AAK33775.1 1 
gi|1 362201 9|gb|AAK33776.1 1 
gi|13622020|gb|AAK33777.1 1 
gi|1 3622021 |gb|AAK33778.1 1 
gi|13622024|gb|AAK33781.1| 
gi|13622025|gb|AAK33782.1 1 
gi|13622026|gb|AAK33783.1 1 
gi|13622031|gb|AAK33787.1| 
gi|13622032|gb|AAK33788.1 1 
gi)13622033|gb|AAK33789.1| 



gi|13622034|gb|AAK33790.1 
gi|1 3622035|gb|AAK33791 .1 
gi|1 3622039|gb|AAK33794.1 
gj|1 3622041 |gb|AAK33796.1 
gi|13622042|gb|AAK33797.1 
gi|13622043|gb|AAK33798.1 
gi|13622044|gb|AAK33799.1 
gi 1 1 3622045|gb | AAK33800. 1 
gi|13622046|gb|AAK33801.1 
gi|13622048|gb|AAK33802.1 
gi 1 1 3622049 |gb j AAK33803. 1 
gi|13622050|gb|AAK33804.1 
gi|1 3622051 |gb|AAK33805.1 
gi j 1 3622052|gb | AAK33806. 1 
gi|13622054|gb|AAK33808.1 
gi|13622055|gb|AAK33809.1 
gi|1 3622056|gb|AAK3381 0.1 
gi|13622058|gb|AAK33812.1 
gi|1 3622060|gb|AAK3381 3.1 
gi|13622062|gb|AAK33815.1 
gi|13622064|gb|AAK33817.1 
gi|13622065|gb|AAK33818.1 
gi|1 3622068|gb|AAK33821 .1 
gi|13622069|gb|AAK33822.1 
gi 1 1 3622070|gb | AAK33823. 1 
gi|1 3622071 |gb|AAK33824.1 
gi 1 1 3622073 |gb | AAK33825. 1 
gi|13622074|gb|AAK33826.1, 
gi|1 3622075|gb|AAK33827.1 
gi 1 1 3622077|gb | AAK33829. 1 
gi| 1 3622079|gb| AAK33831 . 1 
gi|13622083|gb|AAK33834.1 
gi|13622085|gb|AAK33836.1 
gi|1 3622086|gb|AAK33837.1 
gi|1 3622087|gb|AAK33838.1 
gi|1 3622088[gb|AAK33839.1 
gi|13622089|gb|AAK33840.1 
gi|13622090|gb|AAK33841 .1 
gi|13622091 |gb|AAK33842.1 
gi|13622092|gb|AAK33843.1 
gi|13622093|gb[AAK33844.1 
gi|13622095|gb|AAK33845.1 
gi 1 1 3622096 |gb|AAK33846. 1 
gi|13622097|gb|AAK33847.1 
gi|13622162|gb|AAK33908.1 
gi|1 36221 63|gb|AAK33909.1 
gi|1 36221 64|gb|AAK3391 0.1 
gi|136221 65|gb|AAK3391 1 .1 
gi|1 36221 66 |gb|AAK3391 2.1 
gi|13622169|gb|AAK33914.1 
gi|13622170|gb|AAK33915.1 
gi|1 36221 71 |gb|AAK3391 6.1 
gi|13622172|gb|AAK33917.1 
gi|1 36221 74|gb|AAK3391 9.1 
gi| 1 3622 1 75 jgb | AAK33920 . 1 
gi|13622176|gb|AAK33921.1 
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gi|1 36221 77|gb|AAK33922.1 
gi|13622179|gb|AAK33923.1 
gi|1 36221 80|gb|AAK33924.1 
gi|1 36221 81 |gb|AAK33925.1 
gi|1 36221 82|gb|AAK33926.1 
gi| 1 3622 1 83 |gb | AAK33927. 1 
gi|1 36221 84|gb|AAK33928.1 
gi|13622185|gb|AAK33929.1 
gi|1 36221 86|gb|AAK33930.1 
gi|1 36221 89|gb|AAK33932.1 
gi|1 36221 90|gb|AAK33933.1 
gi|13622191|gb|AAK33934.1 
gi 1 1 3622 1 92 |g b | AAK33935. 1 
gi|13622198|gb|AAK33940.1 
g i 1 1 3622200 |g b | AAK33942 . 1 
gi|1 3622201 |gb|AAK33943.1 
gi|13622204|gb|AAK33946.1 
gi|13622205|gb|AAK33947.1 
gi|13622207|gb|AAK33949.1 
gi|13622208|gb|AAK33950.1 
gi|1 362221 1 |gb|AAK33952.1 
gi|13622213|gb|AAK33954.1 
gi|13622214|gb|AAK33955.1 
gi|13622215|gb|AAK33956.1 
gj|13622216|gb|AAK33957.1 
gi|1 362221 7|gb|AAK33958.1 
gi|1 362221 8|gb|AAK33959.1 
gi|13622219|gb|AAK33960.1 
gi|13622222|gb|AAK33962.1 
gi|13622223|gb|AAK33963.1 
gi|13622224|gb|AAK33964.1 
gi|1 3622233|gb|AAK33972.1 
gi|13622235|gb|AAK33974.1 
gi|13622236|gb|AAK33975.1 
gi|13622237|gb|AAK33976.1 
gi|13622239|gb|AAK33978.1 
gi|13622240|gb|AAK33979.1 
gi|1 3622241 |gb|AAK33980.1 
gi|13622242|gb|AAK33981 .1 
gi|13622243|gbiAAK33982.1 
gj|13622244|gb|AAK33983.1 
gi|13622250|gbjAAK33988.1 
gi|13622252|gb|AAK33990.1 
gi|13622253|gb|AAK33991.1 
gi|13622255|gb|AAK33993.1 
gi|13622256|gb|AAK33994.1 
gi|13622257|gb|AAK33995.1 
gi|13622259|gb|AAK33996.1 
gi|1 3622260|gb|AAK33997. 1 
gi|1 3622261 |gb|AAK33998.1 
gi|13622262|gb|AAK33999.1 
gi|13622263|gb|AAK34000.1 
gj|13622264|gb|AAK34001 .1 
gi|13622265|gb|AAK34002.1 
gi|1 3622266|gb|AAK34003. 1 
gi|13622268|gb|AAK34005.1 



3622269 |gb|AAK34006. 1 
3622271 |gb|AAK34007.1 
3622272 |gb|AAK34008. 1 
3622273 |gb|AAK34009. 1 
3622274|gb|AAK34010.1 
3622275|gb|AAK34011.1 
3622276|gb|AAK34012.1 
3622277|gb|AAK34013.1 
3622278|gb|AAK34014.1 
3622279|gb|AAK34015.1 
3622281 |gb|AAK3401 7.1 
3622282|gb|AAK34018.1 
3622283|gb|AAK34019.1 
3622284 |gb| AAK34020 . 1 
3622285|gb|AAK34021 .1 
3622287|gb|AAK34022.1 
3622288 |gb j AAK34023. 1 
3622289|gb|AAK34024.1 
3622290|gb|AAK34025.1 
3622294 |gb| AAK34029. 1 
3622295|gb|AAK34030.1 
3622296|gb|AAK34031 .1 
3622297|gb|AAK34032.1 
3622298|gb|AAK34033.1 
3622299|gb|AAK34034.1 
3622301 |gb|AAK34035.1 
3622306|gb|AAK34040.1 
3622326 |gb| AAK34058. 1 
3622328 |gb| AAK34060. 1 
3622329|gb|AAK34061 .1 
3622330|gb|AAK34062.1 
3622332|gb|AAK34064.1 
3622333|gb|AAK34065.1 
3622335|gb|AAK34066.1 
3622338|gb|AAK34069.1 
3622339|gb|AAK34070.1 
3622340|gb|AAK34071 .1 
3622341 |gb|AAK34072.1 
3622343|gb|AAK34073.1 
3622350|gb|AAK34080.1 
3622351 |gb|AAK34081.1 
3622352|gb|AAK34082.1 
3622353|gb|AAK34083.1 
3622355|gb|AAK34084.1 
3622356|gb|AAK34085.1 
3622357|gb|AAK34086.1 
3622358|gb|AAK34087.1 
3622359|gb|AAK34088.1 
3622360|gb|AAK34089.1 
3622361 |gb|AAK34090.1 
3622362|gb|AAK34091.1 
3622363|gb|AAK34092.1 
3622364|gb|AAK34093.1 
3622366|gb|AAK34094.1 
3622367 jgb | AAK34095. 1 
3622368|gb|AAK34096.1 
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gill 3622369|gb|AAK34097.1 
gij 1 3622370 |gb| AAK34098. 1 
gi|1 3622371 |gb|AAK34099.1 
gi|13622372|gb|AAK34100.1 
gi|1 3622373 |gb|AAK341 01 .1 
gij 1 3622374 jgb [ AAK341 02. 1 
gi|13622375|gb|AAK34103.1 
gi|1 3622376 jgb|AAK341 04.1 
gi|13622377|gb|AAK34105.1 
gi|1 3622378lgb|AAK341 06.1 
gi|13622380|gb|AAK34107.1 
gij 1 3622383 |gb|AAK341 10.1 
gi|13622384jgb|AAK341 1 1.1 
gijl3622387|gbjAAK341 14.1 
gi|13622389|gb|AAK341 16.1 
gi|13622394|gb|AAK34120.1 
gi|13622395|gb|AAK34121.1 
gi|13622396|gb|AAK34122.1 
gi|13622398|gb|AAK34124.1 
gi|13622399|gb|AAK34125.1 
gi|13622400|gb|AAK34126.1 
gijl 3622401 jgb|AAK34127.1 
gi|13622403|gb|AAK34128.1 
gi|13622405|gb|AAK34130.1 
gi|13622406Igb|AAK34131.1 
gijl3622407|gbjAAK34132.1 
gi|13622408|gb|AAK34133.1 
gi| 1 362241 5|gb|AAK341 39.1 
gi|13622416|gb|AAK34140.1 
gi|13622417|gb|AAK34141.1 
gi|13622419|gb|AAK34143.1 
gijl3622420|gb|AAK34144.1 
gi|13622424|gb|AAK34147.1 
gijl3622425|gb|AAK34148.1 
gi|13622431 |gb|AAK34153.1 
gi|13622432|gb|AAK34154.1 
gi|13622433|gb|AAK34155.1 
gi j 1 3622434 |gb | AAK341 56 . 1 
gi|13622435|gb|AAK34157.1 
gi|13622436jgb|AAK34158.1 
gi| 1 3622437 |gb|AAK341 59. 1 
gi|13622444|gb|AAK34165.1 
gi|1 3622447|gb|AAK341 68.1 
gi|13622450|gb|AAK34170.1 
gi| 13622451 |gb|AAK34171.1 
gi|1 3622455|gb|AAK341 75.1 
gi|13622457|gb|AAK34177.1 
gi|13622458|gb|AAK34178.1 
gi 1 1 3622460 1 gb | AAK34 1 79 . 1 
gi|1 3622461 |gb|AAK34180.1 
gi|13622462|gb|AAK34181.1 
gijl3622463|gbjAAK34182.1 
gi j 1 3622464 |gb j AAK34 183.1 
gi|13622465|gbjAAK34184.1 
gi|13622467|gb|AAK34186.1 
gi|13622468|gb|AAK34187.1 



1 3622471 |gb|AAK341 89.1 
13622473|gb|AAK34191.1 
1 3622474|gb|AAK341 92. 1 
1 3622477|gb|AAK341 95. 1 
1 3622478 |gb j AAK341 96. 1 
1 3622479Igb|AAK341 97.1 
13622481 ]gbjAAK34198.1 
1 3622482|gb|AAK341 99. 1 
1 3622483Igb|AAK34200.1 
13622484Igb|AAK34201 .1 
13622485|gbjAAK34202.1 
1 3622486|gbjAAK34203.1 
13622491 |gb|AAK34207.1 
1 3622492jgb|AAK34208.1 
1 3622493|gb|AAK34209.1 
13622494|gb|AAK3421 0.1 
13622495jgb|AAK3421 1.1 
1 3622496 |gb| AAK342 12.1 
1 3622497|gb|AAK342 13.1 
1 3622499 |gb|AAK3421 4.1 
1 3622500|gb|AAK3421 5. 1 
1 3622501 |gb|AAK3421 6. 1 
1 3622506 |gb|AAK34221 .1 
13622507|gbjAAK34222.1 
13622508|gb|AAK34223.1 
1 3622509|gb|AAK34224.1 
1362251 1|gb|AAK34225.1 
1 362251 2|gbjAAK34226.1 
1 362251 3jgbjAAK34227.1 
1 362251 5jgbjAAK34229.1 
13622516|gb|AAK34230.1 
1 362251 7|gb jAAK3423 1 . 1 
1 362251 8 jgb jAAK34232. 1 
1 3622520|gb|AAK34233.1 
1 3622521 |gb|AAK34234.1 
1 3622523|gb|AAK34236.1 
1 3622524|gb|AAK34237.1 
1 3622525|gb|AAK34238.1 
1 3622526 jgb|AAK34239. 1 
13622527|gb|AAK34240.1 
1 3622579|gb|AAK34289.1 
1 3622583|gb|AAK34292.1 
1 3622585|gb|AAK34294. 1 
1 3622587|gb|AAK34296.1 
1 3622588|gbjAAK34297.1 
1 3622590jgb|AAK34299.1 
13622591 |gb|AAK34300.1 
13622593|gb|AAK34301.1 
1 3622595|gb|AAK34303.1 
1 3622596jgb|AAK34304.1 
13622597|gb|AAK34305.1 
13622598|gb|AAK34306.1 
13622599|gb|AAK34307.1 
1 3622600|gb|AAK34308.1 
13622601 |gb|AAK34309.1 
13622603|gb|AAK34310.1 
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gi|13622604|gb|AAK3431 1.1 1 
gi|13622606|gb|AAK34313.1| 
gi|13622607|gb|AAK34314.1| 
gi|1 3622608|gb|AAK3431 5.1 1 
gi|13622609|gb|AAK34316.1| 
gi|1 362261 0|gb|AAK3431 7.1 1 
gi 1 1 36226 1 1 |gb | AAK343 18.1 1 
gi|1 362261 2)gb|AAK3431 9.1 1 
gi|1 362261 5|gb|AAK34321 . 1 1 
gi|1 362261 6|gb|AAK34322. 1 1 
gi|1 362261 7|gb|AAK34323.1 1 
gi|13622618|gb|AAK34324.1I 
gi 1 1 362262 1 |gb | AAK34327. 1 j 
gi 1 1 3622622 |gb | AAK34328 . 1 j 
gi|13622623|gb|AAK34329.1 1 
gi|13622624|gb|AAK34330.1 1 
gi|13622625IgblAAK34331.1| 
gi|13622626|gb|AAK34332.1 1 
gi j 1 3622628 jgb j AAK34333 . 1 j 
gi|13622629|gb|AAK34334.1 1 
gi|13622630|gb|AAK34335.1 j 
gi|1 3622631 |gb|AAK34336.1 1 
gi 1 1 3622632 1 g b | AAK34337 . 1 1 
gi|13622634|gb|AAK34339.1| 
gi|13622636|gb|AAK34341.1| 
gi|1 3622640[gb|AAK34344.1 j 
gi|13622641 |gb|AAK34346.1| 
gi|13622652Igb|AAK34355.1| 
gi|13622653|gb|AAK34356.1 1 
gi|13622654|gb|AAK34357.1 j 
gi|13622656|gb|AAK34359.1| 
gi|13622660|gb|AAK34363.1| 
gi|13622665|gb|AAK34367.1| 
gi|13622668|gb|AAK34370.1| 
gi 1 1 3622675|gb|AAK34376. 1 1 
gi|13622676|gb|AAK34377.1 1 
gi 1 1 3622683 |gb j AAK34383 . 1 1 
gi 1 1 3622684 |gb | AAK34384 . 1 1 
gi|13622685|gb|AAK34385.1 1 
gi|13622688|gb|AAK34387.1 1 
gi|13622689|gb|AAK34388.1 
gi|13622690|gb|AAK34389.1 
gi|1 3622691 |gb|AAK34390.1 
gi|13622692|gb|AAK34391.1 
gi|13622693Igb|AAK34392.1 
gi|13622694|gb|AAK34393.1 
gi|13622695|gb|AAK34394.1 
gi|13622696|gb|AAK34395.1 
gi|13622698|gb|AAK34396.1 
gi|13622699|gb|AAK34397.1 
gi|13622700|gb|AAK34398.1 
gi|1 3622701 |gb|AAK34399.1 
gi|13622702|gb|AAK34400.1 
gi|13622703|gb|AAK34401 .1 
gi 1 1 3622704 1 gb | AAK34402 . 1 
gi|13622705|gb|AAK34403.1 



gi) 1362271 1 |gb|AAK34408.1 1 
gi| 1 362271 3|gb|AAK3441 0.1 1 
gi| 1 362271 4|gb|AAK3441 1 . 1 1 
gi|1 362271 5|gb|AAK3441 2.1 1 
gi 1 1 362271 8|gb|AAK3441 4.1 1 
gi) 1362271 9|gb|AAK3441 5.1 1 
gi|1 3622720|gb|AAK3441 6.1 1 
gi|1 3622721 |gb|AAK3441 7.1 1 
gi|1 3622722|gb|AAK3441 8. 1 ! 
gi|13622723|gb|AAK3441 9.1 \ 
gi 1 1 3622727|gb|AAK34422. 1 | 
gi|13622728|gb|AAK34423.1 1 
gi|13622729|gb|AAK34424.1 1 
gi|13622730|gb|AAK34425.1 1 
gi|13622731|gb|AAK34426.1| 
gi|13622733|gb|AAK34428.1 1 
gi|1 3622734|gb|AAK34429.1 1 
gi|13622735|gb|AAK34430.1 1 
gi|1 3622736|gb|AAK34431 .1 ( 
gi|13622737|gb|AAK34432.1 1 
gi|1 3622740|gb|AAK34434.1 1 
gi|1 3622741 |gb|AAK34435.1 [ 
gi|13622742|gb|AAK34436.1 ( 
gi|1 3622744|gb|AAK34438. 1 1 
gi|13622745|gb|AAK34439.1 1 
gi|13622746|gb|AAK34440.1 1 
gi|13622749|gb|AAK34442.1 1 
gi|1 3622750|gb|AAK34443.1 1 
gi|1 3622751 |gb|AAK34444.1 1 
gi|13622752|gb|AAK34445.1 1 
gi|1 3622753|gb|AAK34446.1 1 
gi|13622754|gb|AAK34447.1| 
gi|1 3622760|gb|AAK34452.1 1 
gi|1 3622762|gb|AAK34454.1 1 
gi|1 3622763|gb|AAK34455.1 1 
gi 1 1 3622764 |gb | AAK34456. 1 ( 
gi|1 3622765|gb|AAK34457. 1 1 
gi|13622766|gb|AAK34458.1 1 
gi|13622767|gb|AAK34459.1 1 
gi|13622768|gb|AAK34460.1 1 
gi|1 3622770|gb|AAK34462.1 1 
gi|1 3622771 |gb|AAK34463.1 [ 
gi|13622774|gb|AAK34465.1 ! 
gi| 1 3622775|gb|AAK34466. 1 
gi|1 3622776Igb|AAK34467.1 
gi|1 3622777 jgb|AAK34468.1 
gi|13622778|gb|AAK34469.1 
gi| 1 3622779 |gb| AAK34470. 1 
gi|1 3622780|gb|AAK34471 .1 
gi|13622781 |gb|AAK34472.1 
gi|13622782|gb|AAK34473.1 
gi|13622783|gb|AAK34474.1 
gi|13622785|gb|AAK34475.1 
gi|13622787|gb|AAK34477.1 
gi|13622789|gb|AAK34479.1 
gi|13622790|gb|AAK34480.1 
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gi|1 3622791 |gb|AAK34481 .1 1 
gi|13622792|gb|AAK34482.1 1 
gi|13622793|gb|AAK34483.1| 
gi|13622794|gbIAAK34484.1 1 
gi|13622795|gb|AAK34485.1| 
gj|1 3622796|gb|AAK34486.1 1 
gi|13622798|gb|AAK34487.1 j 
gi|13622799|gb|AAK34488.1 1 
gi|13622800|gb|AAK34489.1 1 
gi|13622801 |gb|AAK34490.1 1 
gi|1 3622802|gb|AAK34491 .1 1 
gi|13622803|gb|AAK34492.1 1 
gi|13622804|gb|AAK34493.1 1 
gj|13622805|gb|AAK34494.1 1 
gi|13622806|gb|AAK34495.1| 
gi|13622807|gb|AAK34496.1| 
gi|13622808|gb|AAK34497.1 j 
gi|13622809|gb|AAK34498.1| 
gi|13622810|gb|AAK34499.1 1 
gi|13622812|gb|AAK34500.1| 
gi|13622813|gb|AAK34501.1| 
gi|13622814|gb|AAK34502.1 1 
gi|1 362281 5|gb|AAK34503.1 1 
gi| 1 362281 8|gb|AAK34506.1 1 
gj|13622821 |gb|AAK34509.1 1 
gi|1 3622822|gb|AAK3451 0.1 1 
gi|1 3622823|gb|AAK3451 1 .1 1 
gijl3622825|gb|AAK34512.1 1 
gi|13622826|gb|AAK34513.1 1 
gii13622827|gb|AAK34514.1| 
gi|13622828|gb|AAK34515.1| 
gi|1 3622829|gb|AAK3451 6.1 1 
gi|1 3622830|gb|AAK3451 7.1 1 
gi|13622833|gb|AAK34520.1 1 
gi|13622838|gb|AAK34524.1 1 
gi|13622839|gb|AAK34525.1 ! 
gi|13622840|gb|AAK34526.1 1 
gi 1 13622841 |gb|AAK34527.1 
gi|13622847|gb|AAK34532.1 
gi|13622848[gb|AAK34533.1 
gi|13622849|gb|AAK34534.1 
gi|13622853|gb|AAK34537.1 
gi|13622854|gb|AAK34538.1 
gi|13622856|gb|AAK34540.1 
gi|13622857|gb|AAK34541.1 
gi 1 1 3622858 |gb | AAK34542. 1 
gi|13622860|gb|AAK34543.1 
gi|1 3622861 |gb|AAK34544.1 
gi|13622862|gb|AAK34545.1 
gi 1 1 3622863 |gb | AAK34546. 1 
gi|13622864|gb|AAK34547.1 
gi 1 1 3622865 |gb | AAK34548. 1 
gi|13622866|gb|AAK34549.1 
gi|13622867|gb|AAK34550.1 
gi|13622868|gb|AAK34551.1 
gi|13622869|gb|AAK34552.1 



gi|13622870|gb|AAK34553.1 
gi|1 3622873|gb|AAK34555.1 
gi 1 1 3622875 |g b | AAK34557 . 1 
gi|13622876|gb|AAK34558.1 
gi|13622877|gb|AAK34559.1 
gi|13622878|gb|AAK34560.1 
gi|13622879|gb|AAK34561 .1 
g i 1 1 3622880 |g b | AAK34562 . 1 
gi|1 3622881 |gb|AAK34563.1 
gi|13622882|gb|AAK34564.1 
gi| 1 3622885|gb| AAK34566 . 1 
gi|13622886|gb|AAK34567.1 
gi 1 1 3622887 |g b| AAK34568 . 1 
gi j 1 3622888 jgbj AAK34569 . 1 
gi|13622890|gb|AAK34571 .1 
gi|13622893|gb|AAK34574.1 
gi|13622896|gb|AAK34576.1 
gi 1 1 3622898 |g b| AAK34578 . 1 
gi|13622899|gb|AAK34579.1 
gi|13622900|gb|AAK34580.1 
gi|1 3622901 |gbjAAK34581 .1 
gi|13622903|gb|AAK34583.1 
gi 1 1 3622905 |gb| AAK34585. 1 
gi 1 1 3622906 |gb| AAK34586 . 1 
gi|13622907|gb|AAK34587.1 
gi|13622908|gb|AAK34588.1 
gi 1 1 36229 1 0|gb | AAK34589 . 1 
gi 1 1 362291 1 |gb | AAK34590 . 1 
gi|1 362291 2|gb|AAK34591 .1 
gi|13622913|gb|AAK34592.1 
gij 1 362291 4 jgbj AAK34593. 1 
gi|13622915|gb|AAK34594.1 
gi 1 1 36229 1 7 |g b| AAK34596 . 1 
gi|13622918|gb|AAK34597.1 
gi|13622919|gb|AAK34598.1 
gi|1 3622921 |gb|AAK34599.1 
gi|1 3622922|gb|AAK34600.1 
gi| 1 3622924 |gb|AAK34602 . 1 
gi|13622925|gb|AAK34603.1 
gi|1 3622926|gb|AAK34604.1 
gi|13622927|gb|AAK34605.1 
gi 1 1 3622928 jgb j AAK34606 . 1 
gi|13622929|gb|AAK34607.1 
gi|13622930|gb|AAK34608.1 
gi|1 3622931 jgb|AAK34609.1 
gijl3622933jgb|AAK34610.1 
gi|13622941 |gb|AAK34617.1 
gi|13622944jgb|AAK34620.1 
gi|13622945|gb|AAK34621 .1 
gij 13622947 jgb jAAK34623.1 
gi|13622948|gb|AAK34624.1 
gi|13622949|gbIAAK34625.1 
gi|13622950|gb|AAK34626.1 
gijl3622952jgbjAAK34627.1 
gi|13622955|gb|AAK34630.1 
gi|13622956|gb|AAK34631.1 



713 



WO 2004/018646 



PCT/US2003/026827 



Table 33: List of GAS ORFs which are shared with GBS and Spn 



gi|13622959|gb|AAK34634.1 
gi|1 3622961 |gb|AAK34636.1 
gi 1 1 3622963|gb | AAK34638. 1 
gi|13622964|gb|AAK34639.1 
gi|13622967|gb|AAK34641.1 
gi|13622969|gb|AAK34643.1 
gi|1 3622971 jgb|AAK34645.1 
gi|1 3622973|gbjAAK34647.1 
gi|1 3622974|gb|AAK34648.1 
gi|13622977Igb|AAK34651.1 
gi|1 3622981 |gb|AAK34654.1 
gi 1 1 3622982 (gb| AAK34655. 1 
gi 1 1 3622983 |gb| AAK34656. 1 
gi 1 1 3622984 |gb|AAK34657. 1 
gi 1 1 3622985|gb|AAK34658. 1 
gi 1 1 3622989 |gb|AAK3466 1 . 1 
gi|13622990|gb|AAK34662.1 
gi 1 1 362299 1 |gb|AAK34663. 1 
gi 1 1 3622992|gb|AAK34664. 1 
gi 1 1 3622995 |gb| AAK34666. 1 
gi|1 3622996|gb|AAK34667.1 
gi|13622998|gb|AAK34669.1 
gi|13622999|gb|AAK34670.1 
gi|13623000|gb|AAK34671 .1 
gi|1 3623001 |gb|AAK34672.1 
gi|13623002|gb|AAK34673.1 
gi|13623004|gb|AAK34674.1 
gi|13623005|gb|AAK34675.1 
gi|13623006|gb|AAK34676.1 
gi|13623007|gb|AAK34677.1 
gi|1 3623009|gb|AAK34679.1 
gi 1 1 36230 1 9 jgb | AAK34688. 1 
gi|1 3623020|gb|AAK34689.1 
gi|13623030|gb|AAK34698.1 
gi|1 3623031 |gb|AAK34699.1 
gi|13623032|gb|AAK34700.1 
gi|1 3623033|gbiAAK34701 .1 
gi 1 1 3623038 |gbj AAK34705. 1 
gi|1 3623045|gb|AAK3471 2.1 
gi|13623046|gb|AAK34713.1 
gi|13623047|gb|AAK34714.1 
gi|1 3623049|gb|AAK3471 5.1 
gi|13623050|gb|AAK34716.1 
gi|1 3623051 |gb|AAK3471 7.1 
gi|1 3623052|gb|AAK3471 8.1 
gi| 1 3623053 |gb | AAK347 19.1 
gi|1 3623054|gb|AAK34720.1 
gi|1 3623056|gb|AAK34722.1 
gi|1 3623058|gb|AAK34724.1 
gi|13623062|gb|AAK34727.1 
gi|13623064|gb|AAK34729.1 
gi|13623065|gb|AAK34730.1 
gi|1 3623069|gb|AAK34733.1 
gi|1 3623074|gb|AAK34738.1 
gi|1 3623081 (gb|AAK34744.1 
gi|1 3623082|gb|AAK34745.1 



gi|13623083|gb|AAK34746.1 
gi|13623085|gb|AAK34747.1 
gi 1 1 3623086|gb|AAK34748. 1 
gi|1 3623088|gb|AAK34750.1 
gi|13623089)gb|AAK34751.1 
gi|13623090|gb|AAK34752.1 
gi 1 1 3623091 |gb|AAK34753. 1 
gi 1 1 3623093|gb|AAK34755. 1 
gi 1 1 3623095|gb |AAK34756. 1 
gi|1 3623096|gb|AAK34757. 1 
gi|13623098|gb|AAK34759.1 
gi|13623099|gb|AAK34760.1 
gi|13623100|gb|AAK34761.1 
gi 1 1 3623 1 02 |gb | AAK34763. 1 
gi 1 1 3623 1 03 |gb | AAK34764. 1 
gi 1 1 3623 1 05|gb|AAK34766. 1 
gi|13623107|gb|AAK34767.1 
gi|13623128|gb|AAK34787.1 
gi|1 36231 29|gb|AAK34788. 1 
gi|1 36231 31 |gb|AAK34790. 1 
gi|13623132|gb|AAK34791.1 
gi|13623133|gb|AAK34792.1 
gi 1 1 3623 1 34 jgb j AAK34793. 1 
gi 1 1 3623 1 36 1 g b | AAK34794 . 1 
gi|1 36231 38|gb|AAK34796. 1 
gij 1 1 3623 1 39|gb|AAK34797. 1 
gi 1 1 3623 1 50|gb | AAK34807. 1 
gi 1 1 3623 1 51 jgb | AAK34808. 1 
gi|13623152|gb|AAK34809.1 
gi|1 36231 54|gb|AAK3481 1 . 1 
gi|1 36231 55|gb|AAK3481 2.1 
gi|1 36231 56|gb|AAK3481 3. 1 
gi|13623157|gb|AAK34814.1 
gi|13623159|gb|AAK34815.1 
gi|13623161|gb|AAK34817.1 
gi|1 36231 62|gb|AAK3481 8. 1 
gi|1 36231 63|gb|AAK3481 9.1 
gi|1 36231 65|gb|AAK34821.1 
gi|13623166|gb|AAK34822.1 
gi|13623167|gb|AAK34823.1 
gi 1 1 3623 1 68 jgb j AAK34824. 1 
gi|13623170|gb|AAK34826.1 
gi|13623171|gb|AAK34827.1 
gi 1 1 36231 75|gb|AAK34830. 1 
gi|1 36231 76|gb|AAK34831.1 
gi 1 1 3623 1 77 jgbj AAK34832 . 1 
gi|1 36231 79|gb|AAK34834.1 
gi|1 36231 80|gb|AAK34835.1 
gi|1 36231 82|gb|AAK34836. 1 
gi |1 36231 83|gb|AAK34837. 1 
gi 1 1 36231 84|gb| AAK34838. 1 
gi 1 1 36231 85|gb|AAK34839. 1 
gi 1 1 36231 86|gb|AAK34840. 1 
gi|13623187|gb|AAK34841.1 
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gi|13621 381 |gb|AAK331 95.1 1 
gi|13621423|gb|AAK33233.1| 
gi|13621440|gb|AAK33249.1| 
gj|1 3621 443|gb|AAK33251 .1 1 
gi|13621453|gb|AAK33260.1| 
gi|13621454|gb|AAK33261.1| 
gi|13621479|gb|AAK33284.1| 
gi|13621482|gb|AAK33287.1| 
gi|13621492|gb|AAK33296.1| 
gi|13621493|gb|AAK33297.1| 
gi|13621497|gb|AAK33300.1| 
gi 1 1 362 1 498|gb|AAK3330 1 . 1 1 
gi|13621 51 2|gb|AAK33314.1 1 
gi|13621514|gb|AAK33316.1| 
gi 1 1 3621 556|gb|AAK33354. 1 1 
gi|1 3621 570|gb|AAK33366.1 1 
gi|1 3621 587|gb|AAK33382.1 1 
gi|13621610|gb|AAK33403.1| 
gi|13621613|gb|AAK33405.1| 
gi|13621626|gb]AAK33418.1| 
gi|13621632|gb|AAK33423.1| 
gi|1 3621 635|gb|AAK33426.1 ] 
gi|13621643|gb|AAK33433.1| 
gi|13621 655|gb|AAK33444.1 1 
gi 1 1 362 1 656 |gb | AAK33445. 1 ] 
gi|13621659|gb|AAK33448.1 1 
gi|1 3621 673|gb|AAK33461 .1 1 
gi|13621686|gb|AAK33473.1| 
gi 1 1 362 1 696|gb j AAK33482. 1 1 
gi|1 3621 703|gb|AAK33488. 1 1 
gi|1 3621 71 2|gb|AAK33497.1 
gi|1 3621 728|gb|AAK3351 1.1 
gi|1 3621 738|gb|AAK33520.1 
gi|1 3621 739|gb|AAK33521 .1 
gi|13621740|gb|AAK33522.1 
gi|13621 772|gb|AAK33551 .1 
gi|13621776|gb|AAK33555.1 
gi|1 3621 791 |gb|AAK33569.1 
gi|13621798|gb|AAK33575.1 
gi|1 3621 801 |gb|AAK33578.1 
gi|13621803|gb|AAK33580.1 
gi|1 3621 804|gb|AAK33581 .1 
gi|1 3621 832|gb|AAK33606. 1 
gi|1 3621 833|gb|AAK33607. 1 
gi|13621896|gb|AAK33665.1 
gi 1 1 362 1 897 |gb| AAK33666 . 1 
gi|13621906|gb|AAK33674.1 
gi|1 362191 1 |gb|AAK33679.1 
gi|1 3621 949|gb|AAK3371 3.1 
gi| 1 3621 951 |gb|AAK3371 5.1 
gi| 1 3621 962 |gb|AAK33724 . 1 
gi|1 3621 963|gb|AAK33725. 1 
gi|1 3621 964|gb|AAK33726.1 
gi|1 3621 971 |gb|AAK33732.1 
gi|13621976|gb|AAK33737.1 
gi| 1 3621 983|gb|AAK33744.1 



gi|1 3621 988|gb|AAK33748. 1 1 
gi|13622014|gb|AAK33772.1 1 
gi|1 362201 5|gb|AAK33773. 1 1 
gi|13622022|gb|AAK33779.1| 
gi|13622023|gb|AAK33780.1 1 
gi 1 1 3622028|gb| AAK33784. 1 1 
gi|13622029|gb|AAK33785.1| 
gi|13622037|gb|AAK33792.1 1 
gi|13622038|gb|AAK33793.1 1 
gi|13622040|gb|AAK33795.1 ( 
gi|13622057|gb|AAK3381 1.1 1 
gi|13622061 |gb|AAK33814.1 1 
gi 1 1 3622063 jgb | AAK3381 6. 1 [ 
gi 1 1 3622066|gb| AAK3381 9. 1 [ 
gi 1 1 3622067|gb| AAK33820. 1 ! 
gi 1 1 3622076|gb|AAK33828. 1 ! 
gi|13622078|gb|AAK33830.1 ! 
gi|13622084|gb|AAK33835.1 
gi|13622098jgb|AAK33848.1 \ 
gi|13622099|gbIAAK33849.1 
gi|1 36221 00|gb|AAK33850. 1 
gi 1 1 3622 1 04|gb | AAK33854. 1 
gi|1 36221 10|gb|AAK33859.1 
gi|1 36221 16|gb|AAK33865.1 
gi|1 36221 24|gb|AAK33873. 1 
gi|13622159|gb|AAK33905.1| 
gi|13622193|gb|AAK33936.1 
gi|1 36221 94|gb|AAK33937.1 
gi 1 1 3622 1 95|gb |AAK3 3938.1 
gi|1 36221 96|gb|AAK33939.1 
gi|13622202|gb|AAK33944.1 
gi|1 3622203|gb|AAK33945.1 
gi|13622206|gb|AAK33948.1 
gi 1 1 36222 1 0 |gb| AAK33951 . 1 
gi|1 3622221 |gb|AAK33961 . 1 
gi|1 3622231 |gb|AAK33971 .1 
gi|13622234|gb|AAK33973.1 
gi|13622238|gb|AAK33977.1 
gi|13622245|gb|AAK33984.1 
gi|13622246|gb|AAK33985.1 
gi|1 3622248|gb|AAK33986.1 
gi|1 3622249|gb|AAK33987.1 
gi|1 3622251 |gb|AAK33989.1 
gi j 1 3622254 gbj AAK33992 . 1 
gi|13622267|gb|AAK34004.1 
gi|1 3622291 |gb|AAK34026. 1 
gi|13622302|gb|AAK34036.1 
gi|1 3622303 |gb|AAK34037.1 
gi|1 3622304|gb|AAK34038.1 
gi|13622327|gb|AAK34059.1 
gi|13622344|gb|AAK34074.1 
gi|13622345|gb|AAK34075.1 
gi|13622346|gb|AAK34076.1 
gi|13622347|gb|AAK34077.1 
gi|13622348|gb|AAK34078.1 
gi|1 3622349|gb|AAK34079.1 
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13622382|gb|AAK34109.1| 


gi| 1 362281 6|gbIAAK34504. 1 


gi| 


1 3622386|gb|AAK341 13.1 j 


gi| 1 362281 7|gb|AAK34505. 1 


gi| 


13622391 |gb|AAK341 18.1 1 


gi| 1 3622846|gb|AAK3453 1 . 1 


g«l 


1 3622392|gb|AAK341 19.1 1 


gi|13622852|gb|AAK34536.1 


Qi| 


1 3622397|gb|AAK341 23. 1 1 


gi[13622874|gb|AAK34556.1 




13622404|gb|AAK34129.1 1 


gi|13622889|gb|AAK34570.1 


g»l 


1 362241 2|gb|AAK341 36.1 j 


gi|1 3622891 |gb|AAK34572.1 


g>l 


1 362241 3|gb|AAK341 37.1 1 


gi|13622892|gb|AAK34573.1 


gi| 


1 362241 4|gb|AAK341 38.1 1 


gi|13622897|gb|AAK34577.1 


g'l 


1 362241 8|gb]AAK341 42.1 1 


gi|13622902|gb|AAK34582.1 


g»l 


13622430|gb|AAK341 52.1 1 


gi|13622904|gb|AAK34584.1 


g»l 


13622446|gb|AAK34167.1| 


gij 1 362291 6|gb|AAK34595. 1 


gij 


1 3622449|gb|AAK341 69.1 1 


gi 1 1 3622923 |gb| AAK3460 1 . 1 


g'l 


1 3622453|gb|AAK341 73. 1 1 


gi|13622934|gb|AAK3461 1.1 


gi| 


13622470|gb|AAK34188.1| 


gi|13622953|gb|AAK34628.1 


g'l 


13622487|gb|AAK34204.1 1 


gi|13622954|gb|AAK34629.1 


gij 


13622490|gb|AAK34206.1 1 


gi 1 1 3622960 |gb |AAK34635. 1 


gi| 


1 3622502|gb|AAK3421 7.1 1 


gi|13622968|gb|AAK34642.1 


g'l 


13622503|gb|AAK34218.1 1 


gi|13622980|gb|AAK34653.1 


gi| 


1 362251 4|gb|AAK34228.1 j 


gi| 1 3622987 |gb| AAK34659. 1 


g>l 


1 3622528|gb|AAK34241 . 1 1 


gi|13623012|gb[AAK34682.1 


gij 


13622540|gb|AAK34252.1 1 


gij 1362301 3 |gb|AAK34683.1 


g'l 


13622541 |gb|AAK34253.1 1 


gij 1 362301 4 |gb|AAK34684. 1 


g'l 


13622544|gb|AAK34255.1 1 


gi|13623015|gb|AAK34685.1 


g j l 


13622545|gb|AAK34256.1 1 


gi|13623016|gb|AAK34686.1 


Qi| 


13622546|gb|AAK34257.1 1 


gi| 1 362301 8 |gb|AAK34687. 1 


gij 


1 3622547|gb|AAK34258. 1 1 


gi|13623022]gb|AAK34691.1 


g>l 


13622548|gb|AAK34259.1 1 


gi| 1 3623029|gb|AAK34697. 1 


gij 


13622550|gb|AAK34261 .1 j 


gi| 1 3623037|gb|AAK34704. 1 


gij 


13622551 |gb|AAK34262.1| 


gi| 1 3623055|gb|AAK34721 .1 


g ] i 


13622552|gb|AAK34263.1 1 


gi| 1 3623060|gb|AAK34725. 1 


gil 


13622556|gb|AAK34267.1 1 


gi| 1 3623061 |gb|AAK34726. 1 


gii 


13622557|gb|AAK34268.1 1 


gi|13623063|gb|AAK34728.1 


g'l 


13622558|gb|AAK34269.1 1 


gi|1 3623066|gb|AAK34731 .1 


gii 


13622559|gb)AAK34270.1 j 


gi|1 3623068|gb|AAK34732.1 


g'l 


13622563|gb|AAK34273.1 j 


gi|13623092|gb|AAK34754.1 


g'l 


13622571 |gb|AAK34281 .1 1 


gi|13623097|gb|AAK34758.1 


g<l 


13622576|gb|AAK34286.1 j 


g i j 1 3623 1 04 |gb | AAK34765 . 1 


gi| 


13622581 |gb|AAK34290.1 j 


gi|13623126|gb|AAK34785.1 


g'l 


13622582|gb|AAK34291 .1 1 


gij 1 36231 30 jgb|AAK34789. 1 


gi| 


13622586|gb|AAK34295.1 1 


gi| 1 36231 37]gb|AAK34795. 1 


gii 


13622589|gb|AAK34298.1 j 


gi 1 1 3623 1 53|gb| AAK348 10.1 


gil 


13622605|gb|AAK34312.1| 


gi 1 1 3623 1 64|gb|AAK34820. 1 


gii 


1 3622633|gb|AAK34338. 1 j 


gi|13623178|gb|AAK34833.1 


gil 


13622635|gb|AAK34340.1 1 




qM 
gil 


13622637|gb|AAK34342.1 1 
13622638|gb|AAK34343.1 1 




gi| 


13622657|gb|AAK34360.1 1 




gil 
gi| 


1 3622707|gb|AAK34404.1 1 
1 362271 6|gb|AAK3441 3.1 1 




gi| 


13622724|gb|AAK34420.1 \ 




g'l 


13622732|gb|AAK34427.1 1 




g'l 


13622743|gb|AAK34437.1 1 




gil 
gil 


13622761 |gb|AAK34453.1 1 
13622773|gb|AAK34464.1 1 




gil 


1 3622788|gb|AAK34478.1 1 
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Table 35: GAS ORFs which are shared with pneumococcus 

but not with GBS 



gi|13621338|gb|AAK33157.1| 
gi|1 3621 352|gb|AAK331 68.1 1 
gi|13621410|gb|AAK33221.1| 
gi|1 3621 433|gb|AAK33242.1 1 
gi|13621445|gb|AAK33253.1 1 
gi|13621446|gb|AAK33254.1| 
gi|13621447|gb|AAK33255.1 1 
gi|13621448|gb|AAK33256.1 1 
g i 1 1 362 1 449 |gb j AAK33257. 1 1 
gi|13621451|gb|AAK33259.1| 
gi|13621460|gb|AAK33267.1j 
gi|13621466|gb|AAK33272.1| 
gi|13621489|gb|AAK33293.1| 
gi|13621490|gb|AAK33294.1 1 
gi|1 3621 51 9|gb|AAK33320.1 j 
gi| 1 3621 520|gb|AAK3332 1 . 1 1 
gi|13621653|gb|AAK33443.1| 
gi|13621722|gb|AAK33506.1| 
gi|13621723|gb|AAK33507.1| 
gi|13621724|gb|AAK33508.1| 
gi|13621805|gb|AAK33582.1 1 
gi|13621900|gb|AAK33669.1| 
gi|1 362201 1 |gb|AAK33769.1 1 
gi|13622212|gb|AAK33953.1| 
gi|13622280|gb|AAK34016.1 1 
gi|13622381 |gb|AAK34 108.1 1 
gi|13622409|gb|AAK34134.1 1 
gi| 1 362241 0|gb|AAK341 35. 1 1 
gi|13622423|gb|AAK34146.1 1 
gi|1 3622428|gb|AAK341 51 . 1 1 
gi|1 3622441 |gb|AAK341 62.1 1 
gi|1 3622442|gb|AAK341 63.1 1 
gi|13622454|gbIAAK34174.1 1 
gi|13622456|gb|AAK34176.1| 
gi|13622619|gb|AAK34325.1 1 
gi|13622642|gb|AAK34346.1 j 
gi|13622643|gb|AAK34347.1 1 
gi|13622664|gb|AAK34366.1 j 
gi|13622666|gb|AAK34368.1 1 
gi|13622667|gb|AAK34369.1 1 
gi|1 3622671 |gb|AAK34372.1 1 
gi|13622672|gb|AAK34373.1 1 
gi|13622673|gb|AAK34374.1 1 
gi|13622674|gb|AAK34375.1 1 
gi|13622679|gb|AAK34380.1 j 
gi|13622680|gb|AAK34381 .1 1 
gi|13622682|gb|AAK34382.1 1 
gi|13622755|gb|AAK34448.1 1 
gi|13622758|gb|AAK34450.1 j 
gi|13622759|gb|AAK34451.1| 
gi|13622835|gb|AAK34521 .1 1 
gi|13622837|gb|AAK34523.1 1 
gi|13622937|gb|AAK34614.1| 
gi|13622942|gb|AAK34618.1| 
gi|13622946|gb|AAK34622.1 1 
gi|13622978|gb|AAK34652.1 1 



gl|1 3623027|gb|AAK34695.1 1 
gi|13623087|gb|AAK34749.1 1 
gi[13623101|gb|AAK34762.1| 
gi 1 1 3623 1 44|gbIAAK34802. 1 1 
gi|13623146|gb|AAK34804.1 j 
gi 1 1 3623 1 47 |gb | AAK34805. 1 j 
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Table 36: Spn ORF's are shared with GBS and GAS 



SP0001 


SP0158 


SP0254 


SP0385 


SP0002 


SP0173 


SP0259 


SP0386 


SP0003 


SP0179 


SP0261 


SP0387 


SP0004 


SP0180 


SP0262 


SP0400 


SP0005 


SP0184 


SP0263 


SP0401 


SP0006 


SP0185 


SP0264 


SP0402 


SP0007 


SP0186 


SP0265 


SP0403 


SP0008 


SP0187 


SP0266 


SP0404 


SP0010 


SP0189 


SP0268 


SP0405 


SP001 1 


SP0192 


SP0271 


SP0406 


SP0013 


SP0194 


SP0272 


SP0408 


SP0014 


SP0197 


SP0273 


SP0410 


SP0019 


SP0199 


SP0274 


SP041 1 


SP0021 


SP0202 


SP0280 


SP0412 


SP0024 


SP0204 


SP0281 


SP0415 


SP0027 


SP0205 


SP0282 


SP0416 


SP0032 


SP0208 


SP0283 


SP0417 


SP0033 


SP0209 


SP0284 


SP0418 


SP0034 


SP0210 


SP0285 


SP0419 


SP0035 


SP0211 


SP0286 


SP0420 


SP0036 


SP0212 


SP0287 


SP0421 


SP0037 


SP0213 


SP0289 


SP0422 


SP0042 


SP0214 


SP0290 


SP0423 


SP0044 


SP0215 


SP0291 


SP0424 


SP0045 


SP0216 


SP0292 


SP0425 


SP0046 


SP0217 


SP0294 


SP0426 


SP0047 


SP0218 


SP0295 


SP0427 


SP0048 


SP0219 


SP0303 


SP0433 


SP0051 


SP0220 


SP0310 


SP0434 


SP0053 


SP0221 


SP0314 


SP0435 


SP0054 


SP0222 


SP0317 


SP0436 


SP0056 


SP0224 


SP0318 


SP0437 


SP0063 


SP0225 


SP0319 


SP0438 


SP0073 


SP0226 


SP0320 


SP0439 


SP0074 


SP0227 


SP0321 


SP0441 


SP0078 


SP0228 


SP0322 


SP0442 


SP0079 


SP0229 


SP0323 


SP0443 


SP0083 


SP0230 


SP0324 


SP0452 


SP0084 


SP0231 


SP0325 


SP0453 


SP0085 


SP0232 


SP0327 


SP0454 


SP0095 


SP0233 


SP0330 


SP0457 


SP0105 


SP0234 


SP0334 


SP0458 


SP0106 


SP0235 


SP0336 


SP0459 


SP01 1 1 


SP0236 


SP0337 


SP0461 


SP0112 


SP0240 


SP0338 


SP0466 


SP0118 


SP0242 


SP0340 


SP0467 


SP0120 


SP0243 


SP0342 


SP0474 


SP0121 


SP0245 


SP0369 


SP0477 


SP0122 


SP0246 


SP0370 


SP0478 


SP0127 


SP0247 


SP0371 


SP0483 


SP0128 


SP0248 


SP0373 - 


SP0486 


SP0129 


SP0249 


SP0374 


SP0488 


SP0148 


SP0250 


SP0381 


SP0489 


SP0149 


SP0251 


SP0382 


SP0493 


SP0151 


SP0252 


SP0383 


SP0494 


SP0152 


SP0253 


SP0384 


SP0499 
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Table 36: Spn ORF's are shared with GBS and GAS 



SP0500 


SP0652 


SP0787 


SP0895 


SP0501 


SP0657 


SP0788 


SP0896 


SP0502 


SP0660 


SP0792 


SP0897 


SP0515 


SP0662 


SP0793 


SP0904 


SP0516 


SP0663 


SP0797 


SP0905 


SP0517 


SP0665 


SP0798 


SP0908 


SP0519 


SP0668 


SP0799 


SP0909 


SP0521 


SP0669 


SP0801 


SP0912 


SP0522 


SP0671 


SP0802 


SP0923 


SP0523 


SP0672 


SP0803 


SP0927 


SP0526 


SP0673 


SP0805 


SP0928 


SP0549 


SP0674 


SP0806 


SP0929 


SP0550 


SP0675 


SP0807 


SP0931 


SP0552 


SP0676 


SP0816 


SP0932 


SP0553 


SP0678 


SP0817 


SP0933 


SP0554 


SP0680 


SP0820 


SP0935 


SP0555 


SP0681 , 


SP0822 


SP0936 


SP0556 


SP0687 


SP0823 


SP0937 


SP0557 


SP0688 


SP0824 


SP0938 


SP0563 


SP0689 


SP0825 


SP0943 


SP0567 


SP0690 


SP0828 


SP0944 


SP0568 


SP0701 


SP0829 


SP0945 


SP0576 


SP0702 


SP0831 


SP0946 


SP0577 


SP0709 


SP0835 


SP0947 


SP0578 


SP0713 


SP0837 


SP0948 


SP0579 


SP0726 


SP0838 


SP0954 


SP0581 


SP0727 


SP0839 


SP0955 


SP0588 


SP0729 


SP0841 


SP0959 


SP0589 


SP0735 


SP0843 


SP0960 


SP0591 


SP0736 


SP0844 


SP0961 


SP0592 


SP0741 


SP0845 


SP0962 


SP0593 


SP0744 


SP0846 


SP0964 


SP0603 


SP0745 


SP0847 


SP0966 


SP0604 


SP0746 


SP0848 


SP0967 


SP0605 


SP0756 


SP0851 


SP0968 


SP0608 


SP0757 


SP0852 


SP0969 


SP0610 


SP0758 


SP0855 


SP0970 


SP061 1 


SP0760 


SP0856 


SP0971 


SP0613 


SP0761 


SP0862 


SP0972 


SP0614 


SP0762 


SP0864 


SP0974 


SP0615 


SP0764 


SP0865 


SP0975 


SP0616 


SP0765 


SP0867 


SP0976 


SP0618 


SP0766 


SP0868 


SP0978 


SP0620 


SP0767 


SP0869 


SP0979 


SP0622 


SP0768 


SP0870 


SP0980 


SP0623 


SP0770 


SP0871 


SP0981 


SP0624 


SP0771 


SP0872 


SP0984 


SP0626 


SP0775 


SP0873 


SP0985 


SP0630 


SP0776 


SP0875 


SP0987 


SP0631 


SP0778 


SP0876 


SP0988 


SP0636 


SP0779 


SP0877 


SP0989 


SP0637 


SP0780 


SP0878 


SP0991 


SP0638 


SP0782 


SP0880 


SP0992 


SP0645 


SP0784 


SP0881 


SP0993 


SP0646 


SP0785 


SP0893 


SP1002 


SP0647 


SP0786 


SP0894 


SP1003 
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Table 36: Spn ORFs are shared with GBS and GAS 



SP1004 


SP1117 


SP1242 


SP1387 


SP1008 


SP1118 


SP1244 


SP1388 


SP1010 


SP1119 


SP1245 


SP1389 


SP1012 


SP1128 


SP1246 


SP1390 


SP1016 


SP1151 


SP1247 


SP1393 


SP1017 


SP1152 


SP1248 


SP1394 


SP1018 


SP1155 


SP1249 


SP1395 


SP1020 


SP1156 


SP1260 


SP1396 


SP1021 


SP1157 


SP1263 


SP1397 


SP1022 


SP1159 


SP1266 


SP1398 


SP1024 


SP1160 


SP1275 


SP1399 


SP1025 


SP1161 


SP1276 


SP1400 


SP1026 


SP1162 


SP1277 


SP1402 


SP1029 


SP1163 


SP1278 


SP1403 


SP1033 


SP1164 


SP1279 


SP1404 


SP1034 


SP1167 


SP1280 


SP1405 


SP1035 


SP1168 


SP1283 


SP1406 


SP1045 


SP1169 


SP1284 


SP1407 


SP1056 


SP1174 


SP1285 


SP1408 


SP1067 


SP1175 


SP1286 


SP1409 


SP1068 


SP1176 


SP1287 


SP1411 


SP1069 


SP1177 


SP1288 


SP1412 


SP1070 


SP1178 


SP1289 


SP1413 


SP1071 


SP1179 


SP1290 


SP1414 


SP1072 


SP1180 


SP1291 


SP1415 


SP1073 


SP1182 


SP1293 


SP1416 


SP1074 


SP1184 


SP1297 


SP1420 


SP1076 


SP1185 


SP1298 


SP1421 


SP1079 


SP1187 


SP1299 


SP1427 


SP1081 


SP1190 


SP1308 


SP1428 


SP1082 


SP1191 


SP1316 


SP1429 


SP1083 


SP1192 


SP1324 


SP1434 


SP1084 


SP1193 


SP1329 


SP1435 


SP1087 


SP1197 


SP1330 


SP1445 


SP1088 


SP1200 


SP1331 


SP1446 


SP1089 


SP1202 


SP1336 


SP1448 


SP1090 


SP1204 


SP1341 


SP1449 


SP1093 


SP1205 


SP1354 


SP1450 


SP1094 


SP1207 


SP1355 


SP1452 


SP1095 


SP1208 


SP1357 


SP1453 


SP1096 


SP1212 


SP1358 


SP1456 


SP1097 


SP1213 


SP1359 


SP1457 


SP1098 


SP1218 


SP1362 


SP1458 


SP1099 


SP1219 


SP1368 


SP1460 


SP1100 


SP1220 


SP1370 


SP1461 


SP1102 


SP1225 


SP1371 


SP1462 


SP1105 


SP1226 


SP1372 


SP1465 


SP1106 


SP1227 


SP1374 


SP1466 


SP1107 


SP1228 


SP1375 


SP1469 


SP1110 


SP1229 


SP1376 


SP1470 


SP1111 


SP1230 


SP1377 


SP1473 


SP1112 


SP1231 


SP1378 


SP1474 


SP1113 


SP1232 


SP1380 


SP1475 


SP1114 


SP1233 


SP1381 


SP1478 


SP1115 


SP1238 


SP1383 


SP1479 


SP1116 


SP1241 


SP1386 


SP1482 
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Table 36: Spn ORF's are shared with GBS and GAS 



SP1483 


SP1580 


SP1685 


SP1857 


SP1485 


SP1583 


SP1688 


SP1858 


SP1489 


SP1584 


SP1689 


SP1860 


SP1491 


SP1586 


SP1697 


SP1861 


SP1498 


SP1587 


SP1698 


SP1865 


SP1500 


SP1588 


SP1699 


SP1871 


SP1501 


SP1589 


SP1702 


SP1873 


SP1502 


SP1590 


SP1709 


SP1874 




SP1591 


SP1711 


SP1875 




SP1597 


SP1712 


SP1876 


SP1507 


SP1598 


SP1713 


SP1877 




SP1599 


SP1714 


SP1878 


55P1BOQ 


SP1602 


SP1717 


SP1879 


SP1510 


SP1603 


SP1721 


SP1880 


SP151 1 


SP1606 


SP1722 


SP1881 


SP1512 


SP1608 


SP1724 


SP1883 


SP1513 


SP1609 

Wl 1 VJUC7 


SP1725 


SP1884 


SP1517 


SP1610 


SP1726 


SP1887 


SP1518 


SP1615 


SP1727 


SP1888 


SP1519 


SP1616 


SP1732 


SP1889 


SP1521 


SP1617 


SP1733 


SP1890 


SP1522 


SP1624 


SP1734 


SP1895 


SP1523 


SP1625 


SP1735 


SP1896 


SP1529 


SP1626 


SP1736 


SP1900 


SP1530 


SP1631 


SP1737 


SP1901 


SP1534 


SP1633 


SP1738 


SP1902 


SP1535 


SP1638 


SP1739 


SP1903 


SP1536 


SP1644 


SP1742 


SP1906 


SP1537 


SP1645 


SP1743 


SP1908 


SP1538 


SP1646 


SP1744 


SP1909 


SP1539 


SP1647 


SP1746 


SP1916 


SP1540 


SP1648 


SP1747 


SP1918 


SP1541 


SP1649 


SP1748 


SP1922 


SP1542 


SP1650 


SP1749 


SP1940 


SP1544 


SP1652 


SP1750 


SP1942 


SP1547 


SP1653 


SP1752 


SP1944 


SP1549 


SP1655 


SP1759 


SP1953 


SP1551 


SP1659 


SP1776 


SP1957 


SP1552 


SP1661 


SP1780 


SP1960 


SP1553 


SP1662 


SP1781 


SP1961 


SP1554 


SP1664 


SP1782 


SP1963 


SP1557 


SP1665 


SP1785 


SP1964 


SP1558 


SP1666 


SP1790 


SP1966 


SP1559 


SP1667 


SP1795 


SP1967 


SP1560 


SP1668 


SP1799 


SP1968 


SP1561 


SP1670 


SP1804 


SP1969 


SP1563 


SP1671 


SP1816 


SP1970 


SP1564 


SP1672 


SP1817 


SP1972 


SP1565 


SP1674 


SP1825 


SP1973 


SP1566 


SP1675 


SP1839 


SP1974 


SP1568 


SP1676 


SP1840 


SP1975 


SP1569 


SP1677 


SP1845 


SP1976 


SP1571 


SP1681 


SP1847 


SP1979 


SP1574 


SP1682 


SP1848 


SP1980 


SP1575 


SP1683 


SP1851 


SP1981 


SP1577 


SP1684 


SP1855 


SP1982 
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Table 36: Spn ORF's are shared with GBS and GAS 



SP1983 


SP2085 


SP2206 


SP1984 


SP2086 


SP2207 


SP1985 


SP2087 


SP2208 


SP1987 


SP2088 


SP2209 




SP2090 


SP2210 


or i <j\j\j 


SP2091 


SP2214 


Or 1 5757 1 


or c~\j<j C- 


SP2215 


qp<|QQO 
Or 1 57570 


Or ^V757*T 


SP2216 


Or 1 5757*t 


Or £.\J\3<D 


SP2219 


or i yyo 


or 1 v/vs 


SP2220 


QP1QQ7 
Or I 5757 / 


9P9101 
or <c 1 yj 1 


SP2221 


Or I yso 


O I 1 «J VJ 


SP2222 


Or I 575757 


0 r <c. 1 \j t 


SP2224 


OrZUUO 


Or t 1 UO 


SP2225 




or £. 1 \j<j 


SP2226 


oponi n 

OrcU 1 U 


0 r <c_ 1 1 \j 


SP2227 


or ^.l/ 1 t 


\ SP2112 

i or 1 it. 


SP2228 


^P^OI? 
Ortu 1 ^ 


SP21 13 


SP2229 


SP2020 


SP2114 


SP2230 


55P9021 


SP2119 


SP2231 


Or £.\J£-£- 


SP2121 


SP2233 


or 4U& 1 


SP2129 


SP2234 


55P9028 
or 


SP2131 


SP2235 


Or lLL/v_/W 


SP2135 


SP2238 


SP2031 


SP2142 


SP2239 


or /-\j\j£- 


SP2148 


SP2240 


or <uujo 


SP2150 




^P9iv*4 

ort-v/Ot 


SP2151 

or a. 1 w 1 




^P^fi^ 
or^uoj 


SP2152 




0 r jl\J\3\j 


5P2153 






or id. 1 vw 




or xuuo 


SP2161 






SP2162 

or ^- 1 v->^_ 




ortu^ 1 


SP2169 




SP2042 


SP2170 




or v/*t*t 


SP2171 




SP2045 


SP2172 




SP2048 


SP2173 




SP2052 


SP2174 




SP2053 


SP2175 




SP2054 


SP2176 




SP2055 


SP2184 


* 


SP2056 


SP2185 




SP2057 


SP2186 




SP2058 


SP2187 




SP2063 


SP2188 




SP2065 


SP2189 




SP2069 


SP2191 




SP2070 


SP2192 




SP2072 


SP2193 




SP2073 


SP2194 




SP2075 


SP2195 




SP2077 


SP2202 




SP2078 


SP2203 




SP2082 


SP2204 




SP2083 


SP2205 
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Table 37: Spn ORF's which are shared with GBS but not with GAS 



Qpnm 9 

OrUU 1 


opo79^ 
OrU/ £.Q 




OrU / OU 


QPfiflOQ 


OrU/ OS 


orUUOU 


orU/4y 


orUUo2 


qda7ha 


QDA* A7 

oroi u/ 


QDA7K4 

orO/bl 


oPOl 1o 


SP0752 




SP07OO 


oP014o 


SP0754 


o r*»o a cc\ 

SP0150 


SP0769 


SP0175 


SP0789 


SP0176 


SP0791 


SP0177 


SP0826 


SP0178 


SP0900 


oP02o/ 


oioaah o 

SP0913 


SP0255 


SP0914 


oP02bO 


oP09o9 


oPOZb/ 


on a a a a 
OP0941 


CDrt07Q 

orU<:/o 


oP0942 


orUZoo 


oPuyoo 


oPOo4b 


opuy/ o 


orUo4/ 


QDAQ77 


orOo4o 


CD-I AH 4 

orlOl 1 


QDAQ/1 Q 

orOo4y 


CD-I ah 
orlOl O 


orUooD 


QDH 007 


ODf\Q7fi 

orUo/D 


orl 0b4 


ODAvl H O 

oP041 o 


ODH ACC 


oP044o 


ODH AQ A 

oPIOoO 


oP04b2 


OPIUOO 


SP0463 


SP1 121 


SP0479 


oP1 122 


OP0480 


o r~> H h oo 
oP1 123 


OP0482 


CD-I HO/! 

OP1124 


QD A/1 Q A 

oP04o4 


orl 12D 


ODHK17 

orUOo/ 


QDH H 07 

ori 12 / 


orUOoo 


CD-I A r>~7 


orObbb 


orl Ibb 


QDflRftft 

orUoou 


QD-i A 7Q 

orl 1 1 0 


orUOoo 


or 1 iy^- 


oruoyy 


or 1 lao 


OrUDUU 


QPHOH c 

or 1 £. \ o 


QPHfifH 
OrUDUT 


opi OylA 

or iZH-u 


orUDUO 


QP1 OCR 

or ii£oo 


QPOfiOT 
OrUOU f 


or 1 cSj \ 


QPflfifiQ 
oruouy 


QP1971 
or 1 £- i 1 


SP0617 


SP1272 


SP0627 


SP1273 


SP0655 


SP1274 


SP0656 


SP1306 


SP0710 


SP1310 


SP071 1 


SP1332 


SP0717 


SP1333 


SP0718 


SP1334 


SP0720 


SP1346 


SP0723 


SP1348 


SP0724 


SP1350 



qpn OCA 
or i oou 


QP1Q07 

or iyz/ 


QpH Q«H 

or 1 00 1 


QDH QOft 


or 1 000 


QD-1 CkAl 

oriy4o 


QP-1 OQO 

or 1 00/ 


QDH ACQ 

or iyby 


Or 1 ot5*f 


Q DO A A H 

or2UUl 


QD'l OAO 


QDOAAO 

or 2002 


QD'l A A7 

oP144f 


SP2009 


QDH vl CH 

oP14o1 


SP2026 


oP14bo 


SP2029 


QDH A CZA 

oP14b4 


SP2039 


QDH A~7 A 
OP14/ 1 


SP2061 


ODH A 70 
OP14/2 


oP2064 


oP1524 


SP2066 


ODHCOT 1 

oP1o27 


SP2079 


QDH CAA 

oPIoOO 


SP2084 


QDH CAR 

oPIbUo 


SP2095 


QDH Cn7 

oPlbO/ 


OP2096 


QDH COO 

orl bo2 


OnOAAO 

SP2098 


QDH RQ^ 


0 P50 h no 

OP2100 


QDH CKH 

orl bbl 


QDOH OT 

SP2127 


QDH C70 

orlb/o 


OOO H O O 

oP212o 


QDH CQA 

or 1 boO 


QDOH OA 
OP21o0 


QDH CQC 

oPIbyo 


ODOH O A 

oP21o4 


QD'l 7AA 

or I / UU 


QDOH Q7 

or21o/ 


QD'l 7A*1 

orl / ul 


QDOH Q Q 

or21oo 


QDH 70n 

or 1 / ZU 


QDOH K7 

or21b/ 


ODH *70A 

orl /Zy 


SP2190 


QDH 7 A A 

orl / 4U 




QDH ~7 A H 

orl /41 




oP1 745 




QDH 7CH 

orl /ol 




QDH 7C7 
OPI / O/ 




QDH 7CQ 

orl /bo 




QDH 7CH 

orl Abl 




orl / b2 




QDH 7CO 

orl foo 




orl / b4 




orl /Ob 




or 1 /OO 




QPi7ft7 
orl / Of 




QPi 7Rft 

or 1 / 00 




QP177H 
or I / /U 




QPH 77H 

or J / / 1 




qph 770 
or \ 1 1 c. 




QP*1 7ftO 

or 1 / 00 




SP 1802 




SP1828 




SP1856 




SP1867 




SP1869 




SP1870 




SP1872 




SP1891 




SP1907 




SP1910 




SP1911 
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PCT/US2003/026827 
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SP0065 

SP0075 

SP0090 

SP0091 

SP0092 

SP0099 

SP0100 

SP0153 

SP0155 

SP0156 

SP0200 

SP0306 

SP0313 

SP0341 

SP0476 

SP0496 

SP0509 

SP0527 

SP0648 

SP0658 

SP0659 

SP0661 

SP0677 

SP0715 

SP0742 

SP0743 

SP0858 

SP0859 

SP0860 

SP0910 

SP0986 

SP0994 

SP0999 

SP1000 

SP1001 

SP1023 

SP1075 

SP1129 

SP1147 

SP1171 

SP1186 

SP1315 

SP1317 

SP1319 

SP1320 

SP1321 

SP1322 

SP1438 

SP1442 

SP1525 

SP1546 

SP1570 

SP1572 

SP1578 

SP1604 

SP1715 



SP1754 
SP1797 
SP1798 
SP1800 
SP1885 
SP1919 
SP1923 
SP1941 
SP1950 
SP2016 
SP2017 
SP2051 
SP2060 
SP21 1 1 
SP2143 
SP2144 
SP2201 
SP2236 
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Table 40: Comparative Sequences relating to SAG0635 



SEQ ID NO 4001 : SAGO 653 FROM THE 2603 V/R GBS TYPE V STRAIN 
ATGAAGAAAGTGTTAGTGAGTAGTCTTTTGGTTTTAGGGATTACGATA 
ACGTTACAAACAGTAGTTG AGG CTAAGGGG CCAAAAGTAG CTTATACACAAG AGGG AATG 
ACTG CTCTTTCGGACACAAATAAAG ATAAAGT CACTACT ATTTCT ATTG ACG AG ATTCAA 
AAAAG CTTAGAAGGTAAGAAGC CGATTACTGTTAGTTTTGATATTGATGATACACTG CTT 
TTCAGTAGTCAATATTTTCAATATGGTAAAGAATATGTAACrCCTGGATCGTTTGATTTT 
CTT CATAAACAAAAATT CTGGGAT CTT GT TGCAAAACGAGGAGAT CAAG ATTCCATT C C C 
AAAGAATATG CTAAAAAATTAATTG CT ATG CATCAAAAACGAGG AGATAAAATTGTTTTT 
ATAACAG GT AGGACAAGAGGGT CAATGTATAAGGAGGG CG AGGTTGATAAAACAG CT AAA 
GCCTTAGCTAAAGATTTTAAATTAGAC^^CCAATTGCTGTAAATTATACAGGCGATAAA 
CCT AAAAAG C CAT ACAAAT ATG AT AAATCATATTATATTAAG AAATATG GTT CAG ACATT 
CATTATGGAGATAGTGATG ACG AT ATTCATGCAG CTAGG G AGGC CGGTG CTAGAC CAATT 
AG AATTTTAAGAG CACCTAATT CT ACAAAT CT AC CTTTAC CAGAAG CTGG AGG CT ACGGT 
GAAGAGGTT CTCGAAAATT CAG CTTAC 

SEQ ID NO 4002 : SAGO 653 FROM THE 090 GBS TYPE III STRAIN 

AAGGGGCCAAAAGTAGCTTATACACAAGAGGGAATGAC 
TG CTCTTTCGGACACAAATAAAGATAAAGT CACTACTATTTCTATTGACG 
AGATT CAAAAAAG CTTAGAAGGTAAGAAG CCGATT ACTGTTAGTTTTGAT 
ATTGATGATACACTACT TTT CAGT AGT CAATATTTT CAATATGGT AAAGA 
ATATGTAACT CCTGGAT CGTTTGATTTT CTT CATAAACAAAAATT CTGGG 
AT CTTGTTG CAAAACGAGGAGAT CAAG ATT CCATT C C CAAAG AATATG CT 
AAAAAATTAATTGCTATG CATCAAAAACGAGGAGATAAAATTGTTTTTAT 
AACAGGTAGGACAAGAGGGTCAATGTATAAGGAGGGCGAGGTTGATAAAA 
CAG CTAAAG C CTT AG CTAAAGATT TTAAATTAGACAAACCAATTG CTGT A 
AATTATACAGGCGATAAACCTAAAAAGCCATACAAATATGATAAATCATA 
TTATATTAAGAAATATGGTTCAGACATTCATTATGGAGATAGTGATGACG 
AT ATT CATG CAG CTAGG GAGGC CG GTG CTAGACCAAT TAGAATTTTAAG A 
GCACCTAATTCT ACAAAT CT AC CTTTAC CAGAAG CTG GAGGCTACGGTGA 
AGAGGTT CTCGAAAATT CAGCTTAC 

SEQ ID NO 4003 : SAGO 653 FROM THE A909 GBS TYPE la STRAIN 

AAGGGGCCAAAAGTAGCTTATACACA 

AGAGGGAATGACTGCTCTTTCGGACACAAATAAAGATAAAGTCACTACTA 
TTTCTATTGACGAGATTCAAAAAAGCTTAGAAC^TAAGAAGCCGATTACT 
GTTAGTTTTGATATTGATGATACACTGCTTTTC^GTAGTCAATATTTTCA 
ATATGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCATAAAC 
AAAAATTCTGGGATCTTGTTG CAAAACGAGGAGAT CAAGATT C CATT CC C 
AAAGAATATG CTAAAAAATTAATTG CT ATG CAT CAAAAACGAGG AGATAA 
AATTGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAGGGCG 
AGGTTGATAAAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTAGACAAA 
CCAATTG CTGTAAATTATACAGGCGATAAACCTAAA7VAG CCATACAAATA 
TGATAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAG 
ATAGTGATGACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATT 
AGAATTTTAAGAG CACCTAATT CT ACAAAT CTAC CTTTACCAGAAG CTGG 
AGGCTACGGTGAAG AGG TT CTCGAAAATT CAG CTTAC 

SEQ ID NO 4004 : SAGO 65 3 FROM THE 18RS21 GBS TYPE II STRAIN 

AAGGG G C CAAAAGTAG CTT ATACACAAGA 

GGGAATGACTGCT CTTT CGGACACAAATAAAG ATAAAGT CACTACT ATTT 
CTATTGACGAGATTCAAAAAAGCTTAGAAGGTAAGAAGCCGATTACTGTT 
AGTTTTG ATATTGATGATACACTG CTTTT CAGTAGTCAATATTT T CAATA 
TGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCATAAACAAA 
AATTCTGC^ATCTTGTTGCAAAACGAGGAGATCAAC^TTCCAITCCCAAA 
GAATATG CTAAAAAATTAATTG CT ATG CAT CAAAAACGAGGAGATAAAAT 
TGTTTTTAT AACAGGTAGGACAAG AGGGT CAATGTATAAG GAGGG CGAGG 
TTGAT AAAACAG CT AAAGCCTTAG CTAAAGATTTTAAATTAG ACAAACCA 
ATTGCTGTAAATTATACAGGCGATAAACCTAAAAAGCCATACAAATATGA 
TAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAGATA 
GTGATGACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATTAGA 
ATTTTAAGAG CACCTAATT CT ACAAAT CTAC CTTTAC CAGAAG CTGG AGG 
CTACGGTG AAGAGGTTCT CG AAAATT CAG CTTAC 

SEQ ID NO 4005 : SAGO 653 FROM THE M732 GBS TYPE III STRAIN 

AAGGGGC CAAAAGTAG CTT ATACACAAGA 

GGGAATGACTGCTCTTTCGGACACAAATAAAGATAAAGTCACTACTATTT 

CTATTGACGAGATTCAAAAAAGCTTAGAAGGTAAGAAGCCGATTACTGTT 

AGTTTTGATATTGATGATACACTGCTTTTCAGTAGTC^^ 

TGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCATAAACAAA 

AATTCTGGGATCTTGTTGCAAAACGAGGAGATCAAGATTCCATTCCCAA^ 

GAATATG CTAAAAAATTAATTG CT ATG CAT CAAAAACGAGGAGATAAAAT 

TGTTTTT ATAACAGGTAGGACAAG AGG GTCAATGTATAAG GAGGG CGAG G 

TTGAT AAAACAG CTAAAG C CTT AG CTAAAG ATTTTAAATTAGACAAAC CA 

ATTGCTGTAAATTATACAGGCGATAAACCTAAAAAGCCATACAAATATGA 

TAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAGATA 

GTGATGACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATTAGA 

ATTTTAAGAG CAC CT AATTCTACAAAT CTAC CTT TAC CAG AAGCTGG AG G 

CTACGGTGAAGAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4006 : SAGO 65 3 FROM THE COH1 GBS TYPE III STRAIN 

AAGGGG CCAAAAGTAG CTTATACACAAG AGGG AATG ACT 
GCTCTTTCGGACACAAATAAAGATAAAGTCACTACTATTTCTATTGACGA 
GATTCAAAAAAG CTTAG AAG GT AAG AAG C CGATT ACTGTTAGTTT TG AT A 
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TTGATGATACACTG CTTTT CAGTAGTCAATATTTT CAATATGGT AAAGAA 
TATGTAACT CCTGGATC GTTTG ATTTT CTT CATAAACAAAAATT CTG GGA 
TCTTGTTG CAAAACG AGGAG ATCAAGATT CCATT C CCAAAGAATATG CTA 
AAAAATT AATTG CT ATG CAT CAAAAACGAG GAGAT AAAATTGTTTTTATA 
ACAGGTAGG ACAAGAGGGT CAATGTATAAGGAGGG CG AGGTTGATAAAAC 
AG CTAAAGCCTTAG CTAAAG ATTTTAAATTAGAGAA^CCAATTG CTGTAA 
ATTATACAGG CGATAAAC CTAAAAAGC CATACAAATATGATAAAT CATAT 
T ATATTAAGAAATATGGTT CAG ACATT CATTATGG AGATAGTGATGACG A 
TATTCATGCAG CTAGGGAG G CCGGTG CTAGACCAATTAGAATTTTAAGAG 
CACCTAATTCTACAAATCTACCTTTACCAGAAGCTGGAGGCTACGGTGAA 
GAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4007 : SAGO 653 FROM THE M781 GBS TYPE III STRAIN 

AAGGGGCCAAAAGTAGCTTATACACA 

AGAGGGAATGACTG CT CTTT CGGACACAAATAAAG ATAAAGT CACTACTA 
TTTCTATTG ACGAGATT CAAAAAAG CTTAGAAGGTAAGAAG CCGATTACT 
GTTAGTTTTG ATATTGATG ATACACTG CTTTT CAGTAGTCAATATTTT CA 
AT ATGGTAAAGAAT ATG TAACTCCTGGAT CGTTTG ATTTT CTTCATAAAC 
AAAAA1T CTGGGAT CTTGTTGCAAAACGAGGAGAT CAAGATTCCATT CCC 
AAAGAAT ATG CTAAAAAATTAATTG CT ATG CATCAAAAACGAGGAGATAA 
AATTGTTTTTATAACAGGTAGGACAAGAG GGT CAATGTATAAGGAGGGCG 
AGGTTGATAAAACAGCTAAAGCCITAGCTAAAGATTTTAAAT^ 
CCAATTGCTGTAAATTATACAGGCGATAAACCTAAAAAGCCATACAAATA 
TGATAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAG 
ATAGTGATGACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATT 
AGAATTTTAAGAGC^CCTAATTCTACAAATCTACCTTTACCAGAAGCTGG 
AGGCTACGGTGAAGAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4008 : SAGO 653 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

AAGG G G C CAAAAGT AG CTT AT ACACAAG A 

GGGAATG ACT G CT CTTTCGG ACACAAATAAAGATAAAGTCACTACT ATTT 
CTATTGACX3AGATT CAAAAAAG CTT AG AAGGTAAGAAG CCGATT ACTGTT 
AGTTTTGATATTGATGAT ACACTGCTTTT CAGTAGT CAATATTTT CAATA 
TGGT AAAGAATATGT AACT C CTGGATCGTTTG ATTTT CTT CATAAACAAA 
AATTCTGGGATCTTGTTGCAAAACGAGGAGATCAAa^TTCCATTCCCAAA 
GAATATG CTAAAAAATTAATTG CTATG CATCAAAAACGAGGAGATAAAAT 
TGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAGGGCGAGG 
TTGATAAAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTAGACAAACCA 
ATTG CTGTAAATTAT ACAGG CGATAAACCT AAAAAGC CAT ACAAATATGA 
TAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAGATA 
GTGATGACG ATATT CATG CAGCTAGGGAG G CCGGTGCTAGACCAATTAGA 
ATTTT AAGAG CACCTAATT CTACAAAT CTAC CTTTACCAGAAG CTGGAGG 
CTACGGTGAAGAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4009 : SAGO 6 53 FROM THE JM9130013 GBS TYPE VIII STRAIN 

AAGGGGCCAAAAGTAGCTTATACACAAGAGGGAAT 

GACTG CT CTT TCGGACACAAAT AAAGATAAAGTCACTACTATTTCT 

ACGAG ATTCAAAAAAGCTT AGAAGGTAAG AAG CCGATTACTGTTAGT TTT 

GATATTGATGATACACTG CTTTT CAGTAGT CAATATTTT CAATATGGTAA 

AGAATATGTAACTCCTGGATCGTl^GATTTTCCTC^TAAACAAAAATTCT 

GG GAT CTTGTTG CAAAACGAGGAGAT CA^G ATTC CATT CC CAAAGAATAT 

GCTAAAAAATTAACTGCTATGC^TCAAAAACGAGGAGATAAAATTGTTTT 

TATAACAGGTAGGACAAGAGGGT CAATGT ATAAGGAG GG CGAGGTTG ATA 

AAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTAGACAAACCAATTGCT 

GTAAATTATACAGGCGATAAACCTAAAAAGCCATACAAATATGATAAATC 

ATATTATATT AAGAAATATG GTT CAGACATT CATTATGG AGATAGTGATG 

ACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATTAGAATTTTA 

AGAG CA.CCTAATT CTACAAAT CTAC CTTT AC CAGAAG CTGGAGG CTACGG 

TGAAGAGGTTCTCGAAAATTCAGCTTAC 
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PRETTY of : /biottnp/msa20031.2{*} August 5, 2002 07:05 

1 ' 50 

msa2 0031 .2{l00_18RS2l} AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

msa20031.2{l00_2603} AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

msa20031.2{l00_A909} AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

msa2 0031.2{lOO_CJB110} AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

msa20031.2{l00_COHl} AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

msa20O31.2{l0O_JM9130013} AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

msa20031.2{l00_M732} AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

msa20031 . 2 { 100_M781 } AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

msa2 0031.2{l00_090} AAGGGGCCAA AAGTAGCTTA TACACAAGAG GGAATGACTG CTCTTTCGGA 

Consensus ********** ********** ********** ********** ********** 



51 

msa2 0031.2{lOO_18RS2l} CACAAATAAA GATAAAGTCA 

msa2 0031. 2 { 100^2603} CACAAATAAA GATAAAGTCA 

msa20031.2{l00_A909} CACAAATAAA GATAAAGTCA 

msa20031.2{lOO_CJB110} CACAAATAAA GATAAAGTCA 

msa20 031.2{l00_COHl} CACAAATAAA GATAAAGTCA 

msa20031.2{lOO_JM9130013} CACAAATAAA GATAAAGTCA 

msa20031.2{l0 0_M7 3 2 } CACAAATAAA' GATAAAGTCA 

msa20031.2{lOO_M78l} CACAAATAAA GATAAAGTCA 

msa2 0031.2{l00_090} CACAAATAAA GATAAAGTCA 

Consensus ********** ********** 



CTACTATTTC 
CTACTATTTC 
CTACTATTTC 
CTACTATTTC 
CTACTATTTC 
CTACTATTTC 
CTACTATTTC 
CTACTATTTC 
CTACTATTTC 



TATTGACGAG 
TATTGACGAG 
TATTGACGAG 
TATTGACGAG 
TATTGACGAG 
TATTGACGAG 
TATTGACGAG 
TATTGACGAG 
TATTGACGAG 



********** ********** 



100 

ATTCAAAAAA 
ATTCAAAAAA 
ATTCAAAAAA 
ATTCAAAAAA 
ATTCAAAAAA 
ATTCAAAAAA 
ATTCAAAAAA 
ATTCAAAAAA 
ATTCAAAAAA 
********** 



101 

msa2OO31.2{l00_18RS2l} GCTTAGAAGG TAAGAAGCCG 

msa20031. 2 { 100^2603} GCTTAGAAGG TAAGAAGCCG 

msa20031.2{lOO_A909} GCTTAGAAGG TAAGAAGCCG 

msa2 0031 . 2 { 100_CJB110 } GCTTAGAAGG TAAGAAGCCG 

msa20031.2{l00_COHl} GCTTAGAAGG TAAGAAGCCG 

msa20031.2{l00_JM9130013} GCTTAGAAGG TAAGAAGCCG 

msa20031.2{l00_M732} GCTTAGAAGG TAAGAAGCCG 

msa20031 .2{100_M781} GCTTAGAAGG TAAGAAGCCG 

msa20031.2{l00_090} GCTTAGAAGG TAAGAAGCCG 

Consensus ********** ********** 



ATTACTGTTA 
ATTACTGTTA 
ATTACTGTTA 
ATTACTGTTA 
ATTACTGTTA 
ATTACTGTTA 
ATTACTGTTA 
ATTACTGTTA 
ATTACTGTTA 



GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 



150 

TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 



********** ********** ********** 



msa20031 .2 { 
msa20031 . 
msa20031 . 
msa20031.2 { 
msa20031 
msa20031.2{l00, 
msa20031 
msa20031 
msa20031 



100_18RS2l} 
2{100_2603} 
2{100__A909} 
100_CJB110} 
2{100_COH1} 
_JM9130013} 
2{100__M732} 
2{100_M781} 
2{100_090} 
Consensus 



151 

CTgCTTTTCA 
CTgCTTTTCA 
CTgCTTTTCA 
CTgCTTTTCA 
CTgCTTTTCA 
CTgCTTTTCA 
CTgCTTTTCA 
CTgCTTTTCA 
CTaCTTTTCA 
**_******* 



GTAGTCAATA 
GTAGTCAATA 
GTAGTCAATA 
GTAGTCAATA 
GTAGTCAATA 
GTAGTCAATA 
GTAGTCAATA 
GTAGTCAATA 
GTAGTCAATA 
********** 



TTTTCAATAT 
TTTT CAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
********** 



GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 



200 

ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 



********** ********** 



201 250 

msa20031.2{l00_18RS2l} TG G ATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

msa20031.2{l00_2603} TGGATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

msa20031.2{l00_A909} TGGATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

msa20031 . 2 { 100J2JB110 } TGGATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

msa20031 -2{100_COH1} TGGATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

msa20031. 2 { 100_JM913 0013 } TGGATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

msa20031.2{lOO_M732} TGGATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

msa20031 -2{lOQ_M78l) TGGATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

msa20031.2{l00_090} TGGATCGTTT GATTTTCTTC ATAAACAAAA ATTCTGGGAT CTTGTTGCAA 

Consensus ********** ********** ********** ********** ********** 

251 300 

rnsa20031.2{l00_18RS2l} AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

msa2 003 1.2 { 100^2603} AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

msa2 0031.2{lOO_A909} AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

msa20031 . 2 { 100_CJB110 } AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

msa2 0031.2{l00_COHl} AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

msa20031.2{lOO_JM9130013} AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

msa20031 .2{100_M732 J AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

msa20031.2{l00_M78l} AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

msa20031.2{l00_090} AACGAGGAGA TCAAGATTCC ATTCCCAAAG AATATGCTAA AAAATTAATT 

Consensus ********** ********** ********** ********** ********** 



301 . 350 

msa20Q31 .2 {100_1BRS21} GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 

msa20031.2{l00_2603} GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 

msa20031.2{lOO_A909} GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 

msa2O031.2{l00_CJB110} GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 

msa20031.2{l00_COHl} GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 

msa20031.2{lOO_JM9130013} GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 

msa20031 .2{100_M732} GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 

msa2 0031.2{l00_M78l} GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 



727 



WO 2004/018646 



Table 40: Comparative Sequences relating to SAG0635 



msa20031. 2 { 100_090} 
Consensus 



GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 
********** ********** ********** ********** ********** 



msa20031 .2{ 
msa20031 . 
msa20031 . 
msa20031.2{ 
msa20031 . 
msa20031.2{l00. 
msa20031 . 
msa20031 . 
msa20031 



100_18RS2l} 
2{l00_2603} 
2{100_A909} 
100_CJB110} 
2{l00_COHl} 
_JM9130013} 
2{100_M732} 
2{100_M781}' 
2{100_090} 
Consensus 



351 

AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
********** 



ATGTATAAGG 
ATGTATAAGG 
ATGTATAAGG 
ATGTATAAGG 
ATGTATAAGG 
ATGTATAAGG 
ATGTATAAGG 
ATGTATAAGG 
ATGTATAAGG 
********** 



AGGGCGAGGT 
AGGGCGAGGT 
AGGGCGAGGT 
AGGGCGAGGT 
AGGGCGAGGT 
AGGGCGAGGT 
AGGGCGAGGT 
AGGGCGAGGT 
AGGGCGAGGT 
********** 



TGATAAAACA 
TGATAAAACA 
TGATAAAACA 
TGATAAAACA 
TGATAAAACA 
TGATAAAACA 
TGATAAAACA 
TGATAAAACA 
TGATAAAACA 
********** 



400 

GCTAAAGCCT 
GCTAAAGCCT 
GCTAAAGCCT 
GCTAAAGCCT 
GCTAAAGCCT 
GCTAAAGCCT 
GCTAAAGCCT 
GCTAAAGCCT 
GCTAAAGCCT 
********** 



msa20031 ,2{ 
msa20031 . 
msa20031 . 
msa20031.2{ 
msa20031 . 
msa20031.2{l00 
msa20031 . 
msa20031 
msa20031 



100_18RS2l} 
2{100_2603} 
2{100_A909} 
100_CJB110 J 
2{100_COH1} 
i_JM9130013} 
2{100_M732} 
2{100_M781} 
2 { 100^090} 
Consensus 



401 

TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
********** 



TTTTAAATTA 
TTTTAAATTA 
TTTTAAATTA 
TTTTAAATTA 
TTTTAAATTA 
TTTTAAATTA 
TTTTAAATTA 
TTTTAAATTA 
TTTTAAATTA 
********** 



GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
********** 



TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
********** 



450 

TTATACAGGC 
TTATACAGGC 
TTATACAGGC 
TTATACAGGC 
TTATACAGGC 
TTATACAGGC 
TTATACAGGC 
TTATACAGGC 
TTATACAGGC 
********** 



msa20031 ,2{ 
msa20031 . 
msa20031. 
msa20031.2{ 
msa20031 . 
msa20031.2{!00 
msa20031 . 
msa20031 . 
msa20031 



100_18RS2l} 
2{100_2603} 
2{100_A909} 
100_CJB110) 
2{100_COH1} 
)_JM9130013} 
2{100_M732} 
2{l00_M78lJ 
,2{l00_090} 
Consensus 



451 

GATAAACCTA 
GATAAACCTA 
GATAAACCTA 
GATAAACCTA 
GATAAACCTA 
GATAAACCTA 
GATAAACCTA 
GATAAACCTA 
GATAAACCTA 



AAAAG C CAT A 
AAAAGCCATA 
AAAAGCCATA 
AAAAGCCATA 
AAAAGCCATA 
AAAAGCCATA 
AAAAGCCATA 
AAAAGCCATA 
AAAAGCCATA 



CAAATATGAT 
CAAATATGAT 
CAAATATGAT 
CAAATATGAT 
CAAATATGAT 
CAAATATGAT 
CAAATATGAT 
CAAATATGAT 
CAAATATGAT 



********** ********** ********** 



AAATCATATT 
AAATCATATT 
AAATCATATT 
AAATCATATT 
AAATCATATT 
AAATCATATT 
AAATCATATT 
AAATCATATT 
AAATCATATT 
********** 



500 

ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
********** 



msa20031 . 2 { 100_18RS2l] 
msa2 0031 . 2 { 100J2603 ] 
msa2 0031 . 2 { 100_A909 ] 
msa20031.2{l00_CJB110] 
msa2 0 03 1 . 2 { 10 0_COH1 ] 
msa20031 . 2 { 100_JM9130013 ] 
maa2 0031. 2{100_M732] 
msa20031.2{l00_M78l] 
msa20031.2{l00_090] 
Consensus 



501 

ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 



GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 



ATGGAGATAG 
ATGGAGATAG 
ATGGAGATAG 
ATGGAGATAG 
ATGGAGATAG 
ATGGAGATAG 
ATGGAGATAG 
ATGGAGATAG 
ATGGAGATAG 



********** ********** ********** 



TGATGACGAT 
TGATGACGAT 
TGATGACGAT 
TGATGACGAT 
TGATGACGAT 
TGATGACGAT 
TGATGACGAT 
TGATGACGAT 
TGATGACGAT 
********** 



550 

ATTCATGCAG 
ATTCATGCAG 
ATTCATGCAG 
ATTCATGCAG 
ATTCATGCAG 
ATTCATGCAG 
ATTCATGCAG 
ATTCATGCAG 
ATTCATGCAG 
********** 



551 600 

msa20031.2{l00_18RS2l} CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

msa2 0031.2{l00_2603} CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

msa2 0031.2{!00_A909} CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

msa2 0031.2{l00_CJB110} CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

msa20031.2{l00_COHl} CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

msa20031-2{l00_JM9130013) CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

msa20031 . 2{lOO_M732) CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

msa2 0031 . 2{100__M781) CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

msa2 003 1.2 {100_090} CTAGGGAGGC CGGTGCTAGA CCAATTAGAA TTTTAAGAGC ACCTAATTCT 

Consensus ********** ********** ********** ********** ********** 



msa20031.2{ 
msa20031 . 
msa20031 . 
msa20031 -2{ 
msa20031. 
msa20031.2{l00, 
msa20031 
msa20031 
msa20031 



100_18RS21 
2 f 100_2603 
2{100_A909 
100_CJB110} 
2{100_COH1} 
_JM9130013} 
2{100_M732} 
2{100_M781} 
•2{ 100^090} 
Consensus 



601 

ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 



CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 



AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 



********** ********** ********** 



TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
********** 



650 

AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
********** 



651 663 

msa20031.2{l00 18RS21} AAATTCAGCT TAC 

msa20031.2f 100_2603j AAATTCAGCT TAC 

msa20031. 2{100_A909) AAATTCAGCT TAC 

msa20031.2{l00_CJB110} AAATTCAGCT TAC 

msa20031.2{l00_COHl} AAATTCAGCT TAC 

msa20031.2{lOO_JM9130013> AAATTCAGCT TAC 

msa2 0031.2{l00_M732} AAATTCAGCT TAC 



728 
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msa20031.2{l00_M78l} AAATTCAGCT TAC 
msa20031.2{l00_090} AAATTCAGCT TAC 
Consensus ********** *** 

SEQ ID NO 4010 : SAGO 653 PROM THE 2603 V/R GBS TYPE V STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTI S IDE IQKSLEGKKPITVSFDI DDTLLFSSQY FQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDSIPKEYAKKLIAMHQKRGDKIVFITGRTRGS 
MYKEGEVDKTAKAIAKDFKLDKPIAVNYTGDKPKKPYKTDKSYYIKKYGSDIHYGDS 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4011 s SAGO 653 FROM THE 090 GBS TYPE III STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTT I S IDE IQKSLEGKKPITVSFDI DDTLLFSSQY FQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VFITGRTRGS 
MYKEGEVDKTAKALAKDFKLDKPIAVMYTGDKPKKPYICYDKSYYIKKYGSDIHYGDSDDD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4012 : SAGO 6 53 FROM THE A909 GBS TYPE la STRAIN 
KGPKVAYTQEGMTAL SDTNKDKVTT I S I DE I QKSLEGKKP I TVS FD I DDTLLFS S QYFQ Y 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VFITGRTRGS 
MYKEGEVDKTAKALAKDFKLDKPIAVNYTGDKPKKPYKYDKSYYIKKYGSDIHYGDSDDD 
I HAAREAGARP I RI LRAPNSTNL PLPEAGGYGEEVLENSAY 

SEQ ID NO 4013 : SAGO 653 FROM THE 18RS21 GBS TYPE II STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTI S IDE I QKSLEGKKP I TVS FDIDDTLLFS SQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VFITGRTRGS 
MYKEGEVDKTAKALAKDFKLDKPIAVNYTGDKPKKPYKYDKSYYI KKYGSD I HYGDSDDD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4014 : SAGO 653 FROM THE COH1 GBS TYPE III STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTI S I DE I QKSLEGKKP ITVS FDIDDTLLFS SQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VFITGRTRGS 
MYKEGEVDKTAKALAKD FKLDKP IAVNYTGDKPKKPYKYDKSYYI KKYGSD I HYGDSDDD 
IHAAREAGARPI RILRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4015 : SAGO 65 3 FROM THE M781 GBS TYPE III STRAIN 

KGPKVAYTQEGMTALSDTNKDK^/TT IS IDE I QKSLEGKKP ITVS FDIDDTLLFSSQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKL I AMHQKRGDKI VF ITGRTRGS 
MYKEGEVDKTAKALAKDFKLDKP I AVNYTGDKPKKPYKYDKS YY I KKYGSDI HYGDSDDD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4016 : SAGO 65 3 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTISIDEIQKSLEGKKPITVSFDIDDTLLFSSQYFQY 
GKEYVTPGS FDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VF ITGRTRGS 
MYKEGEVDKTAKALAKDFKLDKPI AVNYTGDKPKKPYKYDKS YY I KKYGSD I HYGDSDDD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4017 : SAGO 6 53 FROM THE JM9130013 GBS TYPE VIII STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTT I S IDEIQKSLEGKKPITVSFDI DDTLLFSSQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VF ITGRTRGS 
MYKEGEVDKTAKALAKD FKLDKP I AVNYTGDKPKKPYKY 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4018 : SAG0653 FROM THE M732 GBS TYPE III STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTI SIDE IQKSLEGKKPITVSFDI DDTLLFSSQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VF ITGRTRGS 
MYKEGEVDKTAKALAKD FKLDKPIAVNYTGDKPKKPYKYDKSYYIKKYGSDIHYGDSDDD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 
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PRETTY of: /biotmp/msa25122 . 2 { * } August 5, 2002 07:09 



, msa25122 
msa25122.2{ 

msa2 5122. 

msa25122 
msa25122.2{ 

msa25122 . 
msa25122 .2(100, 

msa25122 . 

msa25122 



.2{l00_090} 
100_18RS2l} 
2{100__2603} 
2{l00_A909} 
100_CJB110} 
2{100__COH1} 
_JM9130013) 
2{100_M732} 
2{l00_M78l} 
Consensus 



KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
********** 



GMTALSDTNK 
GMTALS DTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
********** 



DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
********** 



IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
********** 



50 

I TVS FD I DDT 
ITVSPDIDDT 
ITVSFDIDDT 
ITVSFDIDDT 
ITVSFDIDDT 
ITVSFDIDDT 
ITVSFDIDDT 
ITVSFDIDDT 
ITVSFDIDDT 
********** 



msa25122.2{l00_090} 
msa25122.2{l00_18RS21) 
msa25122.2{l00_2603} 
msa25122.2{l00_A909} 
msa25122 . 2 { 100_CJB110 ) 
msa2512 2 . 2 { 10 0_COH1 } 
msa25122 . 2 {100_JM9130013 } 
msa25122.2{l00_M732} 
msa25122.2{l00_M78l} 
Consensus 



51 

LLFSSQYFQY 
LLFSSQYFQY 
LLFSSQYFQY 
LLFSSQYFQY 
LLFSSQYFQY 
LLFSSQYFQY 
LLFSSQYFQY 
LLFSSQYFQY 
LLFSSQYFQY 



GKEYVTPGSF 
GKEYVTPGSF 
GKEYVTPGSF 
GKEYVTPGSF 
GKEYVTPGSF 
GKEYVTPGSF 
GKEYVTPGSF 
GKEYVTPGSF 
GKEYVTPGSF 



DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 



LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 



********** ********** ********** ********** 



100 

IPKEYAKKLI 
I PKE YAKKL I 
IPKEYAKKLI 
IPKEYAKKLI 
IPKEYAKKLI 
IPKEYAKKLI 
IPKEYAKKLI 
IPKEYAKKLI 
IPKEYAKKLI 
********** 



msa25122 
msa25122 .2{ 
msa25122 . 
msa25122 . 
msa25122 ,2{ 
msa25122 . 
msa25122.2{!00, 
msa25122 . 
msa25122 



-2{100_090} 
100 18RS21} 
2{100_2603} 
2{100_A909} 
100_CJB110j 
2{100_COH1) 
_JM9130013} 
2{100_M732} 
2{100_M781} 
Consensus 



101 

AMHQKRGDKI 
AMHQKRGDKI 
AMHQKRGDKI 
AMHQKRGDKI 
AMHQKRGDKI 
AMHQKRGDKI 
AMHQKRGDKI 
AMHQKRGDKI 
AMHQKRGDKI 



VF I TGRTRGS 
VFITGRTRGS 
VFITGRTRGS 
VFITGRTRGS 
VFITGRTRGS 
VFITGRTRGS 
VFITGRTRGS 
VFITGRTRGS 
VFITGRTRGS 



MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 



********** ********** ********** 



AKALAKDFKL 
AKALAKDFKL 
AKALAKDFKL 
AKALAKDFKL 
AKALAKDFKL 
AKALAKDFKL 
AKALAKDFKL 
AKALAKDFKL 
AKALAKDFKL 
********** 
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DKPIAVNYTG 
DKPIAVNYTG 
DKPIAVNYTG 
DKPIAVNYTG 
DKPIAVNYTG 
DKPIAVNYTG 
DKPIAVNYTG 
DKPIAVNYTG 
DKPIAVNYTG 
********** 



msa25122 
msa25122 .2{ 
msa25122 
msa25122 
msa25122.2{ 
msa25122 . 
msa25122.2{l00, 
msa25122 
msa25122 . 



.2{100_090} 
100_18RS2l} 
2{100_2603} 
2{100_A909} 
100_CJB110} 
2{100_COH1} 
_JM9130013} 
2{100_M732} 
2{100_M781} 
Consensus 



151 

DKP KKPYKYD 
DKPKKPYKYD 
DKP KKPYKYD 
DKPKKPYKYD 
DKPKKPYKYD 
DKPKKPYKYD 
DKPKKPYKYD 
DKPKKPYKYD 
DKPKKPYKYD 



KSYYIKKYGS 
KSYYIKKYGS 
KSYYIKKYGS 
KSYYIKKYGS 
KSYYIKKYGS 
KSYYIKKYGS 
KSYYIKKYGS 
KSYYIKKYGS 
KSYYIKKYGS 



DIHYGDSDDD 
DIHYGDSDDD 
DIHYGDSDDD 
DIHYGDSDDD 
DIHYGDSDDD 
DIHYGDSDDD 
DIHYGDSDDD 
DIHYGDSDDD 
DIHYGDSDDD 



IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 



200 

PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 



********** ********** ********** ********** ********** 



201 221 

msa2 5122. 2 {100^090} TNLPLPEAGG YGEEVLENSA Y 

msa2 5122.2{l00_18RS2l} TNLPLPEAGG YGEEVLENSA Y 

msa25122. 2 {100^2603} TNLPLPEAGG YGEEVLENSA Y 

msa25122.2{l00_A909> TNLPLPEAGG YGEEVLENSA Y 

msa25122.2{l00_CJB110} TNLPLPEAGG YGEEVLENSA Y 

msa25122.2{l00_COHl} TNLPLPEAGG YGEEVLENSA Y 

msa25122.2{lOO_JM9130013} TNLPLPEAGG YGEEVLENSA Y 

msa25122.2f 100_M732) TNLPLPEAGG YGEEVLENSA Y 

msa2 5122.2{100_M78l} TNLPLPEAGG YGEEVLENSA Y 

Consensus ********** ********** * 
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SEQ ID NO. 4101: SAG0649 FROM 2603 V/R GBS TYPE V STRAIN 

ATGAAAAAGAGACAAAAAATA 

TGGAGAGGGTTATCAGTTACTTTACTAATCCTGTCCCAAATTCCATTT 
CAAGGTGA^ACCCAAGATACCAATCAAGCACTTGGAAAAGTAATTGTTAAZ^AAACGGGA 
GACAATG CT ACACCATTAGG CAAAG CG ACTTTTGTGTTAAAAAATGACAATG ATAAGT CA 
\ GAAACAAGT CACGAAACGGT AGAGG GTT CTGGAGAAG CAACCTTTGAAA&CATAAAAC CT 
GGAGACTACACATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAACTGATAAAACC 
TGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATA^TCGAGGGTATGGATGCAGATAAA 
G CAGAG AAACGAAAAGAAGTTTTG AATG C CCAAT ATC CAAAATCAGCT ATTT ATGAG GAT 
ACAAAAGAAAATTACCCATTAGTTAATGTAGAGGGTTCCAAAGTTGGTGAACAATACAAA 
GCATT GAAT C CAATAAATGG AAAAG ATGGTCG AAG AG AG ATTGCTGAAGGTTGGTTAT CA 
AAAAAAATTACAGGGGTCAATGATCTCGATAAGAATAAATATAAAATTGAATTAACTGTT 
GAGGGTAAAACGRCTGTTGAAACGAAAGAACTTAATCAA.CCACTAGATGTCGTTGTGCTA 
TTAG ATAATT CAAAT AGTATGAAT AATGAAAG AG C CAAT AATTCT CAAAG AG CATTAAAA 
G CTGGGGAAG CAGTTGAAAAG CTG ATTGATAAAATTACAT CAAAT AAAGACAATAGAGTA 
GCTCTTGTGACATATGCCTCAACCATTTTT 

GTTG CCG AT CAAAATGGTAAAG CG CTG AATGATAGTGTAT CATGGGATTATCATAAAACT 
ACTTTTACAG CAACTACACATAATTACAGTTATTTAAATT TAACAAATG ATG CTAACGAA 
GTTAATATTCTAAAGTCAAGAATT CCAAAGGAAG CGG AG CAT ATAAATGGGG AT CG CACG 
CTCTATCAATTTGGTGCGACATTTACTCAAAAAGCTCT^^ 
GAGACACAAAGTTCTAATGCTAGAAAAAAACTTATTT^ 

ACGATGT CITATGCCATAAATTTTAAT CCTTATATAT CAACA.T CITACCAAAACCAGTTT 

AATTCTTTTTTAAATAAAATAC CAG AT AGAAGTGGTATT CTC CAAGAGG ATTTTATAAT C 

AATGGTGATG AT TAT CAAAT AG TAAAAGGAGATGG AGAGAGTTTT AAACTGTTTT CGGAT 

AGAAAAGTT C CTGTTACTGG AGGAACG ACACAAG CAGCTTAT CGAGTACCGCAAAAT CAA 

CTCTCTGTAATGAGTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTCTATTGG 

AGAGATTACAACTGG GT CTAT C CATTTGATCCTAAGACAAAG AAAGTTTCTG CAACa^\AA 

CAAAT CAAAACTCATGGTG AG C CAACAACATT ATACITTAATGG AAATAT AAGAC CTAAA 

GGTTATGACATTTTTACTGTTGGGATTGGTGTAAACGGAGATCCTGGTG 

GAAG CTG AGAAATTTATGCAAT CAATATCAAGTAAAACAG AAAATTATACTAATGTTGAT 

GATA CAAAT AAAATTTATGATG AG CTAAATAAATACTTTAAAACAATTGTTG AG G AAAAA 

CATTCTATTGTTGATGGAAATGTGACTGATCCTATGGGAGAGATGATTGA^ 

AAAAATGGT CAAAGTTTTACACATG ATGATTACGTTTTGGTTGGAAA CAA 

TTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGATGGGGGAATTTTAAAAGATGTT 

ACAGTGACTTATGATAAGACAT CT CAAACCAT CAAAATCAAT CATTTGAACTTAGGAAGT 

GGACAAAAAGTAGTTCTTACCTATGATGTA03TTTAAAAGAT^ 

TTTT ACAATACAAATAAT CGTACAACG CTAAGTC CGAAG AGTGAAAAAG AAC CAAATACT 
ATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAGTrrCCG^TACTAACCaTC 
AGTAATCAGAAGAAAATGGGTGAGGTTGAATTTATTAAAGTTAA 

GAATCG CTTTTGGG AG CTAAGTTT CAACT T CAGAT AG AAAAAGATTTTT CTGGGTAT AAG 
CAA.TTTGTTCCAGAGGGAAGTGATGTTACAACAAAGAATGATGGTAAAATT^ 
GCACTTC^GATGGTAACTATAAATTATATGAAATTTCAAGTCCAGATGGCTATATAGAG 
GTTAAAA.03AAACCTGTTGTGACATTTACAATTCAAA^TGGAGAAGTTACC^A.CCTGAAA 
GCAGATCCAAATGCTAATAAAAATCAAATCGGGTATCTTGAAGGAAATGGTAAACATCTT 
ATT AC CAACACT C CCAAACG CCCAC CAGGTGTTTTTC CTAAAACAGGGGG AATTGGTACA 
ATTGT CTAT ATATTAGTTG GTTCTACTTTTATGAT ACTTACCATTTGTT CTTTC CGT CGT 
AAACAATTG 

SEQ XD NO. 4102: SAG0649 FROM 090 GBS TYPE la STRAIN 

GGTGAAACCCAAGATACCAATCAAGCACTTGG AAAAG 

TAATTGTTAAAAAAACGGG AGACAATG CTACAC CATTAGG CAAAG CG ACT 

TTTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGT CACGAAACGGT 

AGAGK3GTTCTGGAGAAGCAACCTTTGAAAACATAAAACCTGGAGACTACA 

CATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAA.CTGATAAAACC 

TGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATAATCGAGGGTATGGA 

TGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGCCCAATATCCAA 

AATCAGCTATTTATGAGGATACAAAAGAAAATTACCCATTAGTTAATGTA 

GAGGGTT CGAAAGTTGGTG AACAAT ACAAAG CATTGAATC CAATAAATGG 

AAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAAAAAAAATTA 

CAGGK3GTCAATGATCTCGATAAGAATAAATATAAAATTGAATTAACTGTT 

GAGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACCACTAGATGT 

CGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAAGAGCCAATA 

ATTCTCAAAGAGCATTAAAAGCTGGGGAAGCAGTTGAAAAGCTGATTGAT 

AAAATTACATCAAATAAAGACAATAGAGTAGCTCTTGTGACATATGCCTC 

AACC^TTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAGTTGCCGATC 

AAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTATCATAAAACT 

ACTTTTACAGCAACTACACATAATTACAGTTATTTAA^TTTAACAAATGA 

TG CTAACGAAGTTAATATT CTAAAGTCAAG AATT C CAAAGG AAG CGG AG C 

ATATAAATGGGG AT CG C ACG CT CTATC AATTTGGTGCGACATTTACT CAA 

AAAG CT CTAATG AAAG CAAATG AAATTTT AGAGACACAAAGTTCT AATG C 

TAGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTACGATGTCTT 

ATG CCATAAATTTT AAT CCTT ATATATCAACA.TCT T ACCAAAACCAGTTT 

AATT CTTTT TTAAAT AAAAT AC CAG AT AG AAGTG GT ATTCT C CAAGAGG A 

TTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAGATGGAGAGA 

GTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTGGAGGAACGACA 

CAAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTAATGAGTAATGA 

GGGATATGCAATTAATAGTGGATATATTTaTCTCTATTGGAGAGATTACA 

ACTGGGTCTATCCATTTGATCCTAACrACAA^GAAAGTTTCTGCAACGAAA 

CAAAT CAAAACT CATGGTGAGCCAACAACATTATACTTTAATGGAAATAT 

AAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTGTAAACGGAG 

AT C CTGGTG CAACTC CTCTTGAAG CTG AG AAATTT ATG CAAT CAATATCA 

AGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATGA 

TGAG CTAAATAAATACTTT AAAACAATTGT TG AGG AAAAACATT CTATTG 



731 



WO 2004/018646 



Table 41: Comparative Sequences relating to SAG0649 



TTGATGG AAATGTG ACTGAT CCTATGGGAG AGATGATTG AATT CC^TTA 

AAAAATGGT CAAAGTTTTACACATGATGATTACG t TTTGG tTGGAAATGA 

tGGCAGTCPATTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGATG 

GGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACATCTCAAACC 

ATCAAAATCAATCATTTGAACTTAGGAAGTGGAGAAAAAGTAGTTCTTAC 

CTATGATGTAa3TTTAAAAGATAACTATATAAGTAACAAATTTTACAATA 

CAAATAATCGTACAACGCTAAGTC CGAAGAGTGAAAAAGAAC CAAAT ACT 

ATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGGT 

ACTAACCATCAGTAATCAGAAGAAAATGGGTGAGGTTGAATTTATTAAAG 

TT AATAAAG ACAAA.CATTCAGAATCGCTTTTGGGAG CTAA 

CAGAT AGAAAAAGATTTTT CTG GGTAT AAG CAATTTGTT C CAGAGGG AAG 

TGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAGCACTTCAAG 

ATGGT AACTATAAATT AT ATGAAATTT CAAGTC CAGATGG CTATATAGAG 

GTTAAAACGAA^CCTGTTGTGACATTTAGAATTCAAAATGGAGAAGTTAC 

GAACCTGAAAGCAGATCCAAATGCTAATAAAAATCAAATCGGGTATCTTG 

AAGG AAATG GTAAACATCTT ATTAC CAACACT CCCAAACG CC CACCAGGT 

GTT 

SEQ ID NO. 4103: SAG0649 FROM A909 GBS TYPE la STRAIN 

GGTGAAACCCAAGATACCAATCAAGCACTTGGAAAA 
GTAATTGTTAAAAAAACGGGGGACAATGCTACACCATTAGGCA\AGCGAC 
TTTTGTGTTAAAAAATG ACAATGATAAGTCAgAAACAAGT CACG AAA.CGG 
TAGAGGGTTCTGGAG AAgCAAC CTTTG AAAACAT AAAAC CTG GAG ACTAC 
ACATTAAGAGAAGAAACAG CACCAATTGGTTATAAAAAAACTGAT AAAAC 
CTGG AAAGTTAAAGTTG CAGATAACGGAGCAACAATAAT CGAGGGTATGG 
ATGCAGATAAAG CAG AGAAACX3 AAAAGAAGTTTTGAATG C CCAAT AT C CA 
AAAT CAGCTATTTATGAGGATACAAAAGAAAATTACC CATTAgTT AA.TGT 
AGAGGGTTCCAAAGTTGGTGAACAATACAAAGCATT'G^ 
GAAAAGATGGTCGAAGAGAGATTG CTG AAGGTTGGTTAT CAAAAAAAATT 
ACAGGGGTCAATGATCrCGATAAGAATAAATATAAAATTGA^TTAACTGT 
TGAGGGTAAAAC CACTGTTG AAACG AAAGAACTTAAT CAACCACTAG ATG 
TCGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAAGAGCCAAT 
AATTCT CAAAGAG CATTAAAAG CTGGGGAAG CAGTTG AAAAG CTG ATTGA 
TAAAATTACATCAAATAAAG ACAATAGAGTAG CT CTTGTG ACATATG CCT 
CAACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAGTTGCCGAT 
CAAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTATCATAAAAC 
TACTTTTACAGCAACTACACATAATTAC^ 

ATGCTAACGAAGTTAATATT CTAAAGT CAAGAATT C CAAAGGAAG CGGAG 
CATAT AAATGGGGATCG CACG CTCTAT CAATTTGGTG CG ACATTTACTCA 
AAAAG CTCTAATGAAAG CAT^TGAAATTTTAG AG ACACAAAGTTCTAATG 
CTAGAAAAAAACTT ATTTTT CACGTAACTGATGGTGTCC CTACGATGTCT 
TATG CCATAAATTTT AATCCTTATATATCAACAT CTTACCAAAAC CAGTT 
TAATT CTTTTTTAAAT AAAATAC CAGATAGAAGTGGTATT CT CCAAG AGG 
ATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAGATGGAGAG 
AGTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACT 
ACAAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTA^TGAGTAATG 
AGGGATATGCAATTAATAGTG<3ATATATTTATCTCTATTGGAGAGATTAC 
AACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCT 
ACAAATCAAAACTCATGGTGAGCCAACAACATTATACTTTAATGGAAATA 
TAAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTGTAAACGGA 
GATC CTGGTG CAACT C CT CTTG AAG CTGAG AAATTTATG CAAT CAATAT C 
AAGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATG 
ATGAG CTAAATAAAT ACTTTAAAACAATTGTTGAGGAAAAACATTCTATT 
GTTG ATGGAAATGTG ACTG ATCCTATG GG AGAGATGATTG AATT CCAATT 
AAAAAATGGT CAAAGTTTTACACATGATGATTACG t TTTGG tTGG AAATG 
AtGGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGAT 
GGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACATCTCAAAC 
CATCAAAAT CAATCATTTGAACITAGGAAGTGa^CAAAAAGT AGTTCTTA 
CCTATGATGTACGTTTAAAAGATAACTATATAAGTAAaAAATTTTACAAT 
ACAAATAATCGTACAACGCTAAGTCCGAAGAGTGAAAAAGAACCAAATAC 
TATTCGTGATTT'CCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGG 
TACTAAC CATCAGT AAT CAGAAGAAAATGGGTGAGGTTG AATTTATTAAA 
GTTAATAAAGACAAACATTCAGAATCGCrTTTGGGAGCT 
TC^GATAGAAAAAGATTTTTCTGGGTATAAGCAATTTGTTCCAGAGGGAA 
GTGATGTTACAAiCAAAGAATGATGGTA^AATTTATTTTAAAG CACTT CAA 
GATGGTAACT ATAAATTAT ATGAAATTTCAAGT C CAGAT G G CTATAT AGA 
GGTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAATGGAGAAGTTA 
CGAAC CTGAAAG CAGATCCAAATGCTAATAAAAAT CAAAT CGGGT AT CTT 
GAAGG AAATGGTAAACATCTTATT ACCAACACT C C CAAACGC CCAC CAGG 
TGTT 

SEQ ID NO. 4104: SAG0649 FROM 18RS21 GBS TYPE II STRAIN 
GGTGAAACCCAAGATACCAATCAAGCAC 

TTGGAAAAGTAATTGTTAAAAAAACGGGAGACAaTGCTACACCaTTAGGC 
AAAG CGACTTTTGTGTTAAAAAATGACAATGATAAGT CAG AAACAAGT CA 
CGAAACGGTAGAGGGTTCTGGAGAAgCAACCTTTGAAAACATAAAACCTG 
GAGACTACACATTAAGAGAAGAAACAG CAC CAATTGGTTATAAAAAAACT 
GATAAAAC CTGG AAAGTTAAAGTTG CAGAT AACG GAG CAACAAT AAT CG A 
GGGTATGG ATG CAG ATAAAG CAGAG AAACG AA aAG AAGTTTTGAATG CC C 
AATAT CCAAAAT CAG CT ATTTATG AGG ATACAAAAGAAAATT AC C CATTA 
GTTAATGTAGAGGGTTCCAAAGTTGGTGAACAATACAAAGCATTGAATCC 
AATAAATGGAAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAA 
AAAAAAT Ta C aG GGGT CAATGAT CTCG AT AAG AATAAAT ATAAAATTGAA 
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TTAACTGTTGAGGGTAAAACCACTGTTGAAACC3AAAGAACTTAATCAACC 
ACTAGATGTCGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAA 
GAGC CAATAATT CT CAAAGAG CATTAAAAG CTGGG GAAG CAGTTGAAAAG 
CTGATTG ATAAAATT ACAT CAAAT AAAGACAATAG AGTAG CT CTTGTGAC 
ATATG CCTCAAC CATTTTTG ATGGTACTG AAG CG ACCGTATCAAAGGGAG 
TTG C CGATCAAAATGGTAAAG CG CT GAATG AT AGTGT AT CATGGG ATTAT 
CATAAAACTACITTTACAGCAACT^ 

AAC^\AATGATG CT AACGAAGTT AATATTCT AAAGTCAAG AATTC CAAAG G 

AAG CGGAGCATATAAATGG G GAT CG CACG CT CTATCAATTTGGTG CG ACA 

TTTACTCAAAAAGCTCTAATGAAAGCAAATGAAATTTTAGAGACACAA^ 

TTCTAATGCTAGAAAAAAA.CTTATTTTTCACGTAACT 

CG ATGTCTT ATG C CATAAATTTTAAT CCTT AT AT ATCAACAT CTT AC CAA 

AAC CAGTTTAATT CTTTTTT AAATAAAATAC CAG ATAGAAGTGGT ATT CT 

CCAAG AGGATTTT ATAAT CAATGGTG ATG ATT AT CAAATAGTAAAAGGAG 

ATGGAGAGAGTTTTAAACTG1TTTCGGATAGAAAAGTTCCTGTTACTGGA 

GG AACGACACAAG CAG CTT AT CGAGTAC CG CAAAATCAACT CT CTGT AAT 

GAGT AATGAGGG AT ATG CAATTAAT AGTGGAT ATATTTAT CTCTATTGGA 

GAGATTACAACTGGGT CTAT C CATTTG AT CCTAAGAGAAAGAAAGTTT CT 

GCAACGAAACAAATCAAAACTCATGGTGAGCCAACAACATTATACTT^ 

TGGAAATATAAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTG 

TAAACGGAGATCCTGGTGCA^CTCCTCTTGAAGCTGAGAAATTTATGCAA 

TCAATATCAAGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAA 

AATTTATGATGAGCTAAATAAATACITTAAAACAATTGTTGAGGAAAAAC 

ATTCTATTGTTGATG GAAATGTGACTG AT C CTATGGG AGAGATG ATTGAA 

TTCCAATTAAAAAATGGTCAAAGTTTTACAC^TGATGAT^ 

TGGAA ATGATGGCAGT CAATTA A AAAATGGTGTGG CT CTTGGTGG AC CAA 

ACAGTGATGGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACA 

TCTCAAACCATCAAAATCAATCATTTGAACTTAGGAAGTGGACAAAAAGT 

AGTTCTTACCrATGATGTACGTTTAAAAGATAACTATATAAGTAACAAAT 

TTTACAATACAAAT AAT CGT ACAACG CTAAGTC CG AAGAGTG AAAAAGAA 

CCAAATACTAT t cG t gATT t C C CAATT CC CAAAATT CGTG ATGTTCGTGA 

GTTT C CGGTACT AAC CATCAGT AATCAGAAGAAAATGGGTGAGGTTG AAT 

TTATTAAAGTTAATAAAGACAAACATTCAGAATCGCTTTTGGa^GCTAAG 

TTTCAACTTCAGATAGAAAAAGATTTTTCTGGGTATAAGCAATTTGTTCC 

AGAGGGAAGTGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAG 

CACTTCAAGATGGT A^ CTAT AAATTATATGAAATTTCAAGTC CAG ATGG C 

TATATAGAGGTTAAAACGAAAC CTGTTGTG ACATTTACAATT CAAAATGG 

AG AAGTTACGAACCTGAAAG CAGAT C CAAATG CT AATAAAAATCAAATCG 

GGTATCTTGAAGGAAATGGTAAACATCTTATTACCAACACTCCC^\ACGC 

CCACCAGGTGTT 

SEQ ID NO. 4105: SAG0649 FROM M732 GBS TYPE III STRAIN 

GGTG AAACCCAAGAT AC CAATCAAG CACT 

TGGAAAAGTAATTGTTAAAAAAACGGGAGACAaTGCTACACCATTAGGCA 
AAGCGACTTTTGTGTTAAAAAATGACAATGATAAGTC^GAAA 
GAAACGGTAGAGGGTTCTGGAGAAGCAACCTTTGAAAACATAAAACCTGG 
AGACT ACACATT AAG AG AAG AAACAG CAC CAATTGGTTATAAAAAA^CTG 
AT AAAACCTGGAAAGTTAAAGTTG CAG ATAACGG AG CAACAATAAT CGAG 
GGTATGGATG CAGAT AAAG CAG AGAAACGAAAAG AAGTTTTGAATG C C CA 
ATATCCAAAATCAGCT'ATTTATGAGGATACAAAAGAAAATTACCCATTAg 
TT AAT GTAGAGGGTTCCAAAGTTG GTGAACAATACAAAG CATTG AAT CCA 
ATAAATGGAAAAGATGGTCGAAGAGAGATTG CTG AAGGT TGGTT ATCAAA 
AAAAAaTa CaGGGGTCAATGATCT CGATAAGAATAAATATAAAATTGAAT 
TAACTGTTGAGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACCA 
CT AG ATGTCGTTGTG CTATTAGATAATT CAAATAGTATGAAT AATGAAAG 
AG C CAATAATT CTCAAAGAG CATT AAAaG CTGGGGAAGCAGTTG AAAAG C 
TGATTGATAAAATTACATCAAATAAAGACAATAGAGTAGCTCTTGTGACA 
TATG C CT CAACCATTTTTGATGGTACTGAAGCGAC CGT AT CAAAGGG AGT 
TGCCGATCAAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTATC 
ATAAAACTACTTTTACAGCAACTACACA 

ACAAATG ATG CTAACGAAGTTAAT ATT CT AAAGT CAAGAATT CCAAAGG A 
AGCC<3AGCATATAAATGGGGATCGCACGCTCTATCAATTTGGTGCGACAT 
TT ACT CAAAAAG CTCTAATG AAAG CAAATGAAATTTTAGAGACACAAAGT 
TCTAATGCTAGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTAC 
GATGT CTTATG C CATAAATTTTAAT C CTTATATAT CAACATCTT ACCAAA 
AC CAGTTTAATT CTTTTTT AAATAAAATAC CAGAT AG AAGTGGT ATT CT C 
CAAGAGGATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAGA 
TGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTGGAG 
GAACGACACAAGCAG CTTATCGAGTACCGCAAAAT CAACTCTCTGTAATG 
AGTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTCTATTGGAG 
AGATTACAACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCTG 
CAACGAAACAAAT CAAAACT CATGGTG AG C CAACAA CATT ATACTTT AAT 
GG AAATATAAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTGT 
AAACGGAGATCCTGGTGCAACTCCTCTTGAAGOTGAGAAATTTATGCAAT 
CAAT ATCAAGTAAAACAGAAAATT ATACTAATGT TG ATG AT ACAAATAAA 
ATTTATGATGAGCTAAATAAATACTTTAAAACAATTGTTGAGGAAAAACA 
TT CTATTGTTGATG G AAATGTG AC TG ATC CTATG GGAGAG ATGATTG AAT 
TCCAATTAAAAAATGGTCAAAGTTTTACACATGATGATTACG tTTTGG t T 
GGAAATGAtGGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAAA 
CAGTGATGGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACAT 
CTCAAACCATCAAAATCAATCATTTGAACTTAGGAAGTGGACAAAAAGTA 
GTTCTTACCTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAATT 
TT ACAATACAAAT AAT CGT ACAa CG CT AAGT CCG AAG AGTGAAAAAG AAC 
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CAAATACTATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAG 
TTTCCGGTACTAACCATCAGTAATCAGAAGA?UyVTGGGTGAGGTTGAATT 
TATTAAAGTTAATAAAG ACAAACATTCAGAAT CG CTTTTGGGAG CTAAGT 
TTCAACTTCAGATAGAAAAAGATTTTTCTGGGTATAAGCAATTTGT^ 
GAGGGAAGTGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAGC 
ACTT CAAGATGGTAA.CTATAAATTATATG AAATTTCAAGTCCAGATGG CT 
ATATAGAGGTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAATGGA 
GAAGTTACG AAC CTGAAAG CAG AT CCAAATG CTAATAAAAAT CAAATCGG 
GTATCTTGAAGG AAATGGT AAACAT CTTATTACCAACACTCC CAAACG C C 
CACCAGGTGTT 

SEQ ID NO. 4106: SAGO 649 FROM COH1 GBS TYPE XXI STRAIN 

GGTGAAACCCAAGATACCAATCAAGCACTTGGAAAAG 

TAATTGTTAAAAAAACGGGAGACAaTGCTACACCATTAGGCAAAGCGACT 

TTTGTGTTAAA*U\ATGACAATGAT AAGTGAGAAACAAGT CACGAAACGGT 

AGAGGGTTCTGGArAAGCAACCrTTGAAAACATAAAACCTGGAGACTACA 

CATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAACTGATAAAACC 

TGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATAATCGAGGGTATGGA 

TG CAGATAAAGCAGAGAAACGAAAAGAAGTTTTG AATG CCCAATATC CAA 

AATCAGCTATTTATGAGGATACAAAAGAAAATTACCCATTAgTTAATGTA 

GAGGGTTCCAAAGTTGGTGAACAATaCAAAGCATTGAATCCAATAAATGG 

AAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAAAAAAAAATA 

CAGGGGTCAATGATCTCgATAAGAATAAATATAAAATTGAATTAACTGTT 

GAGGGTAAAACCACTGTTGAAACX5AAAGAACTTAATCAACCACTAGATGT 

CGTTGTG CTATT AG ATAATT CAAATAGTATGAATAATGAAAG AG C CAATA 

ATTCT CAAAG AG CATTAAAAG CTGGGGAAG CAGTTGAAAAGCTGATTGAT 

AAAATTACATCAAATAAAGACAATAGAGTAGCrCTTGTGACATATGCCTC 

AACCATTTTTGATGGTACTGAAG CGACCGTATCAAAGGGAGTTG C CG AT C 

AAAATGGTAAAG CG CTG AATGATAGTGTAT CATGG G ATTATCAT AAAACT 

ACTTTTACAGCAACTACACATAATTACAGTTATTT 

TG CTAACGAAGTTAATATTCTAAAGTCAAGAATT C CAAAGGAAG CGGAGC 
AT AT AAATG GGG AT CGCACG CT CTATCAATTTGGTG CGACATTTACTCAA 
AAAG CTCTAATG AAAG CAAATG AAATTTTAGAGACACAAAGTTCTAATG C 
TAGAAAAAAACTTATlU"iU'CACGTAACTGATGGTGTCCCTACGATGTCTT 
ATGCCATAAATTTT AATCCTTATAT AT CAACATCIT'ACCAAAAC CAGTTT 
AATT CTTTTTTAAATAAAATACCAGATAGAAGTGGTATT CTCCAAGAGG A 
TTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAGATGGAGAGA 
Gl"riTAAACTGTlTTCGGATAGAAAAGTTCCTGTTACTGGAGGAACGACA 
CAAG CAG CTTATCGAGTACCG CAAAATCAACTCT CTGTAATG AGTAATGA 
GGGATATGCAATTAATAGTGGATATATTTATCTCTATTGGAGAGATTACA 
ACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCTGCAACX3AAA 
CAAAT CAAAACT CATGGTGAG CCAACAACATTATACTTT AATGGAAATAT 
AAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTGTAAACGGAG 
AT CCTGGTGCA^CT C CTCTTG AAG CTGAGAAATTTATG CAAT CAATATCA 
AGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATGA 
TGAGCTAAA.TAAATACITTAAAACAATTGTTGAGGAAAAACATTCTATTG 
TTGATGGAAATGTGACTGATCCTATGGGAGAGATGATTGAATTCCAATTA 
AAAAATGGTCAAAGTTTTACACATGATGATTACGTTTTGGTTGGAAATGA 
TGGCAGTCAATTAAAAAATGGTGTGGCTCI^GGTGGACCAAACAGTGATG 
GGGG AATTTTAAAAG ATGTTACAGTGACTT ATGAT AAGACAT CT CAAAC C 
ATCAAAATCAAT CATTTGAACTTAGGAAGTGGACAAAAAGTAGTTCTTAC 
CTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAATTTTACAATA 
CAAATAATCGTACAACGCTAAGTCCGAAGAGTGAAAAAGAACCAAATACT 
ATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGGT 
ACTAACCATCAGTAATCAGAAGAAAATGGGTGAGGTTGAATTTATTAAAG 
TTAATAAAG ACAAACATTCAgAATCG CTTTTG GGAG CTAAGTTTCAACTT 
CAGATAGAAAAAGATTTTT CTGGGTAT AAG CAATTTGTT C CAGAGGGAAG 
' TG ATGTTACAACAAAGAATG ATGGT AAAATTTATTTTAAAGCACTTCAAG 
ATGGTAACTATAAATTATATGAAATTTCAAGTCCAgATGGCTATATAGAG 
GTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAATGGAGAAGTTAC 
GAACCTG AAAG CAG ATCCAAATGCTAATAAAAAT CAAATCGG GT AT CTTG 
AAGGAAATGGTAAACATCTTATTACCAACACTCCCAAACGCCCACCAGGT 
GTT 

SEQ ID NO. 4107: SAG0649 PROM M781 GBS TYPE XII STRAIN 

TTGGAAAAGTAATTGTTAAAAAAACGGGAGACACTGCTACACCATTAGGC 
AAAG CGACTTTTGTGTT AAAAAAT G ACAATGATAAGT CAGAAACAAGT CA 
CGAAACGGTAGAGGGTTCTGGAAAAGCAACCTTTGAZVAACATAAA^CCTG 
GAGACTACACATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAACT 
GATAAAAC CTGGAAAGTTAAAGTTG CAGATAACGG AG CAmCAATAATCG A 
GGGTATGGATGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGCCC 
AATAT C CAAAAT CAG CTATT T ATGAGGATACAAAAGAAAATTACC CATT A 
gTTAATGTAGAGGGTTCCAAAGTTGGTGAACAATACAAAGCATTGAATCC 
AATAAATGGAAAAG ATGGT CgAAG AGAGATTG CTG AAGGTTG GTTAT CAA 
AAAAAATTACaGGGGTCAATGATCTCGATAAGAATAAATATAAAATTGAA 
TTAACTGTTGAGGGTAAAACCACTGTTGAAACgAAAGAACTTAATCAACC 
ACTAGATGTCGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAA 
GAGC CAAT AATTCT CAAAGAG CATT AAAAG CTGGG G AAG CAGTTG AAAAG 
CT GATTGAT AAAATTACATC AAATAAAGACAATAG AGTAG CT CTTGTG AC 
ATATGCCTCAACCATITTTGATGGTACrGAAGCGACCGTATCAAAGGGAG 
TTGCCGATCAAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTAT 
CATAAAACT ACTTTT ACAG CAACTACACATAATTACAGTTATTT AAATT T 
AACAAATGATGCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAAGG 
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AAGCGGAGCATATAAATGGGGATCGCACGCTCTATCAATTTGGTGCGACA 

TTTACTCAAAAAG CTCTAATGAAAG CAAATGAAATTTTAGAGACACAAAG 

TT CTAATGCTAG AAAAAAAGITATTTTTCACGTAACTGAT GGTGTCC CTA 

CGATGTCTTATGCCATAAATTTTAATCCTTATATATCAAGATCrTACCAA 

AACCAGTTTAATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATTCT 

CCAAGAGGATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAG 

ATGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTGGA 

GGAACGACACAAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTAAT 

GAGTAATGAGGGATATGGAATTAATAGTGGATATATTTATCTCTAtTGGA 

GAGATTACAACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCT 

G CAACGAAACAAAT CAAAACT CATGGTGAGCCAACAACATTATACTTTAA 

TGGAAATATAAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTG 

TAAACGG AGATCCTG GTGCAACTC CTCTTG AAG CTGAGAAATTTATG CAA 

TCAATATCAAGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAA 

AATTTATGATGAGCTAAATAAATACTTTAAAACAATTGTTGAGGAAAAAC 

ATTCT ATTGTTG ATGGAAATGTGACTG AT C CT ATGGGAG AGATG ATTG AA 

TTCCAATTAAAAAATGGTCAAAGTTTTACACATGATGATTACGTTTTGGT 

TGGAAATGATGGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAA 

ACAGTGATGGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACA 

TCTTCAAACCATCAAAATCAATCATTTGAACTTAGGAAGTGGACAAAAAGT 

AGTTCITACCTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAAT 

TTT ACAATACAAATAAT CG TACAACG CTAAGT CCGAAGAGTGAAAAAGAA 

CCAAATACTATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGA 

GTTTCCGGTACTAAC CAT CAGTAAT CAGAAGAAAATGGGT GAGGTTG AAT 

TTATTAAAGTTAATAAAGACAAACATT CAGAATCG CTTTTGGGAG CT AAG 

TTTCAACTTCAGATAGAAAAAGATTTTTCrGGGTATAA 

AGAGG GAAGTGATGTTACAACAAAG AATG ATGGTAAAATT TATTTTAAAG 

CACTTCAAG ATGGTAACTATAAATT ATATGAAATTT CAAGTCCAG ATGG C 

TATATAGAGGTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAATGG 

AG AAGTT ACGAACCTGAAAG CAGAT C CAAATG CTAAT AAAAAT CAAATCG 

GGTATCTTGAAGGAAATGGTAAACATCTTATTACC^CACTCCCAAACGC 

CCACCAGGTGTT 

SEQ ID NO. 4108: SAGO 64 9 FROM CJB GBS NONTYPEABLE STRAIN 

GGTGAAACCCAAGAT ACCAAT CAAG CACTTGGAAAAGT 
AATTGTTAAAAAAACGGGAGACAaTG CTACACCATTAGG CAAAG CGACTT 
TTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGTCACGAAACGGTA 
GAGGGTT CTGGAr AAGCAAC CTTTG AAAACATAAAAC CTGGAGACTACAC 
- ATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAACTGATAAAACCT 
GGAAAGTTAAAGTTG CAGAT AACGG AG CAACAAT AATCG AGGGTATGGAT 
GCAG ATAAAG CAGAG AAACG AAAAGAAGTTTTGAATG CC CAATATCCAAA 
AT CAG CTATTTATG AGG AT ACAAAAGAAAATTAC CCATT AgTTAATGTAG 
AGGGTT C CAAAG TTGGTGAACAAT A CAAAG CATTGAATCCAATAAATGGA 
AAAG ATGGT CGAAG AGAGATTG CTG AAGGTTGGTTAT CAAAAAAAATTAC 
aGGGGTCAATGATCTCGATAAGAATAAATATAAAATTGAATTAACTGTTG 
AGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACCACTAGATGTC 
GTTGTGCTATTAgATAATTCAAATAGTATGAATAATG AAAGAG C CAATAA 
TTCTCAAAGAGCATT AAAAG CTGG GGAAG CAGTTGAAAAG CTGAT TGATA 
AAATTACAT CAAAT AAAGACAATAGAGTAGCT CTTGTGACATATG C CT CA 
ACCATTTTTGATGGTACTGAAG(^ACCGTATCAAAGGGAGTTGCCGATCA 
AAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTATCATAAAACTA 
CTTTT ACAG CAACT ACACATAATT ACAGTTATTTAAATTTAACAAATG AT 
GCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAAGGAAGCGGAGCA 
TATAAATGGGGATCG CACG CTCTAT CAATTTGGTG CGACATTTACT CAAA 
AAG CTCTAATGAAAG CAAAT GAAATTTTAGAGACACAAAG TT CTAAT GCT 
AGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTACGATGTCTTA 
TGC CATAAATTTTAATC CTTATATAT CAAC^TCTTACCAAAACCAGTTTA 
ATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATTCTCCAAGAGGAT 
TTTAT AATCAATGGTGATG ATTAT CAAATAGTAAAAG GAG ATGGAGAGAG 
TTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTGGAGGAACGACAC 
AAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTAATGAGTAATGAG 
GG AT ATG CAATTAAT AGTG G AT ATATTTATCT CTATTGGAGAGATTACAA 
CTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCTGCAACGAAAC 
AAATCAAAAC TCATGGTGAG CCAACAACATTATACTTTAATGGAAAT ATA 
AGACCTAAAG GTTATGACATTTTTACTGTTGGGATTGGTGTAAAC GG AGA 
TCCTGGTGCAACTCCTCTTGAAGCTGAGAAATTTATGCAATCAATATCAA 
GTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATGAT 
GAG CTAAATAAATACTTT AAAACAATTGTTGAGG AAAAACATTCTAT TGT 
TGATGGAAATGTGACTGATCCTATGGGAGAGATGATTGAATTCCAATTAA 
AAAATGGTC^UVAGTTTTACACATGATGATTACGTTTTGGTTGGAAATGAt 
GGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGATGG 
GGGAATTTTAAAAG ATGTTACAGT GACTTATG AT AAG ACATCT CAAAC CA 
TCAAAAT CAATCATTTGAACTTAG GAAGTGG ACAAAAAGT AGTT CTTACC 
TATGATGTACGTITAAAAGATAACTATATAAGTAACAAATTTTACAATAC 
AAATAATCGTACAACGCTAAGTCCGAAGAGTGAAAAAGAACCAAATACTA 
TTCGTGATTTCCCAAT t CCCAAAATTCGTGATGTTCGTGAGTTTC CGGTA 
CTAACC^T CAGTAAT CAGAAGAAAATGGGTGAGGTTGAATTT ATT AAAGT 
TAATAAAGACAA^CATT CAGAATCG CTTTTGGGAG CTAAGTTTCAACTTC 
AGATAGAAAAAGATTTTTCTGGGTATAAGCAATTTGTTCCAGAGGGAAGT 
GATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAGCACTTCAAGA 
TGGTAACTATAAATTATATGAAATTTCAAGTCCAGATGGCTATATAGAGG 
TT AAAACGAAAC CTGTTGTG ACAT TTACAATT CAaAATGG AG AAGTTACG 
AACCTGAAAG CAGAT C CAAATG CTAATAAAAAT CAAATCG GGTAT CT TGA 
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AGGAAAT GGTAAACAT CTTATTAC CAACACT C C CAAACG CCCAC CAGGTG 
TT 

SEQ ID NO. 4109: SAG0649 FROM JM9130013 GBS TYPE VIII STRAIN 

GGTGAAACCCAAGATACCAATCAAGCACTTGGAAAAG 

TAATTGTTAAAAAAACGGGAGACAATGCTACACCATTAGGCAAAGCGACT 

TTTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGTCACGAAACGGT 

AGAGG GTTCTGG AGAAG GAACCTTT GAAAACATAAAACCTGGAGACTACA 

CATT AAG AG AAGAAACAGCAC CAATTG GTTAT AAAAAAACTGAT AAAAC C 

TGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATAATCGAGGGTATGGA 

TG CAG AT AAAG CAG AGAAACGAAAAGAAGTTTTG AATGC CCAATAT C CAA 

AATCAGCTATTTATGAGGATACAAAAGAAAATTACCCATTAGTTAATGTA 

GAGGGTTCCAAAGTTGGTGAACAATACAAAGCATTGAATCCAATAAATGG 

AAAAG ATGGT CG AAG AG AG ATTG CTGAAG GTTGGTTATCAAAAAAAATTA 

CAGGGGT CAATG AT CTCGATAAGAATAAATATAAAATTG ARTTAACTGTT 

GAGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACCACTAGATGT 

CGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAAGAGCCAATA 

ATTCTCAAAG AG CATTAAAAG CTGGGG AAG CAGTTGAAAAG CTGATTGAT 

AAAATTACAT CAAATAAAG ACAATAGAGTAG CT CTTGTG ACATATG C CT C 

AACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAGTTGCCGATC 

AAAATGGTAAAG CG CTGAATGATAGTGTATCATGGGATTATCATAAAACT 

ACTTTTACAGCAACTACACATAATTACAGTTATTTAAATTTAACAAATGA 

TGCTAACGAAGl^AATATTCTAAAGTCAAGAATTCCAAAGGAAGCGGAGC 

ATATAAATGGGGATCGCACGCTCTATCAATTTGGTGCGACATTTACTCAA 

AAAGCTCTAATGAAAGCAAATGAAATTTTAGAGACACAAAGTTCTAATGC 

TAGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTACGATGTCTT 

ATGC CATAAATTTTAATCCTTATAT AT CAACATCTTACCAAAACCAGTTT 

AATT CTTTTTTAAAT AAAATAC CAG ATAG AAGTGGTATT CTC CAAGAGGA 

TTTTATAATCAATGGTGATGATTATCAAATAGTAAAA.GGAGATGGAGAGA 

GTTTTAAACTGTTTT CGGATAGAAAAGTT C CTGTT ACTGGAGGAACGACA 

CAAG CAG CTT AT CG AGT AC CG CAAAATCAACT CTCTGTAATGAGTAATGA 

GGGATATGCAATTAATAGTGGATATATTTATCTCTATTGGAGAGATTACA 

ACTG GGT CT ATCCATTTGATC CTAAGACAAAGAAAGTTTCTG CAACGAA^ 

CAAAT CAAAACT CATGGTG AGCCAACAACATTATACTTTAATGG AAATAT 

AA.GACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTGTAAA.CGGAG 

ATCCTGGTGCAACTCCTCTTGAAGCTGAGAAATTTATGCAATCAATATCA 

AGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATGA 

TGAGCTAAATAAAT ACT TTAAAACAA.TTGTTGAGGAAAAA CATT CTATTG 

TTGATGGAAATGTGACTGATCCTATGGGAGAGATGATTGAATTCCAATTA 

AAAAATGGTCAA^GTTTTACACATGATGATTACGTTTTGGTTGGAAATGA 

TGGCAGTCAATTAAAAA^TGGTGTGGCTCT^GGTGGACCAAAa^GTGATG 

GGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACATCTCAAACC 

ATCAAAAT CAAT CATTTGAACTT AGGAAGTGGACAAAAAGTAGTT CTTAC 

CTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAATTTTACAATA 

CAAAT AATCGTACAACG CT AAGT C CGAAG AGTGAAAAAG AACCAAATACT 

ATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGGT 

ACTAACCATCAGTAATCAAAAGAAAATGGGTGAGGTTGAATTTATTAAAG 

IT AAT AAAGACAAACATTCAGAAT CG CTTTTGGGAG CTAAGTTT CAACTT 

CAGATAAAAAAAGATTTTTCTGGGTATAAGCAATTTGTTCCAGAGGGAAG 

TGATGTTACAACAAAGAATGATGGTAAAATTTATTTT^ 

ATGGTAACTATAAATTATATGAAATTTCAAGTCCAGATGGCTATATAGAG 

GTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAATGGAGAAGTTAC 

GAAC CTGAAAGCAG ATCCAAATG CTAATAAAAAT CAAAT CGG GTATCTTG 

AA 
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PRETTY of: /biotmp/msal78297 . 2 { * } May 12, 2003 09:22 .. 

1 50 

msal78297 .2{104__090} 

msal78297.2{l04_18RS2l} 

msal78297 . 2{l04_2603} atgaaaaaga gacaaaaaat atggagaggg ttatcagtta ctttactaat 

msal78297.2{l04_CJB110} ~ 

msa!78297 . 2 {104_COH1 } 

msal78297 . 2{ l04_M732} 

msal78297 . 2 { 104__A909} 

msal78297.2{l04_M78l} 

msal78297 . 2{l04_JM913 0 013} 

Consensus ********** ********** ********** ********** ********** 

51 100 

msal78297. 2 { 104^090} ggtgaa acccaagata 

msal78297 .2 {l04_18RS2l} ggtgaa acccaagata 

msal78297 .2{l04_2603} cctgtcccaa attccatttg gtatattggt acaaggtgaa acccaagata 

msal78297.2{l04_CJB110} ggtgaa acccaagata 

tnsal78297.2|l04_COHl} ggtgaa acccaagata 

msal78297.2{104_M732} ggtgaa acccaagata 

msal78297.2{l04_A909} ggtgaa acccaagata 

msal78297.2{l04_M78l} 

msal782 97 . 2 { 104_JM9130013 } — ggtgaa acccaagata 

Consensus ********** ********** ********** **** __, 



msal78297 
msal78297.2{ 
rasal78297 . 
msal78297.2{ 
msal78297. 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{!04 



.2{l04_090) 
104_18RS21) 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{104_M781} 
: _JM9130013} 
Consensus 



101 

ccaatcaagc 
ccaatcaagc 
ccaatcaagc 
ccaatcaagc 
ccaatcaagc 
ccaatcaagc 
ccaatcaagc 

ccaatcaagc 



acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
— TTGGAAAA 
acTTGGAAAA 
__******** 



GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
********** 



AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
********** 



150 

aGACAaTGCT 
aGACAaTGCT 
aGACAaTGCT 
aGACAaTGCT 
aGACAaTGCT 
aGACAaTGCT 
gGACAaTGCT 
aGACAcTGCT 
aGACAaTGCT 
_****„**** 



msa!78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090} 
104_18RS2lj 
2{l04_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{104__M781} 
OM9130013} 
Consensus 



151 

ACACCATTAG 
ACACCATTAG 
ACACCATTAG 
ACACCATTAG 
ACACCATTAG 
ACACCATTAG 
ACACCATTAG 
ACACCATTAG 
ACACCATTAG 



GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 



********** ********** 



TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 

TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
********** 



AAAAATGACA 
AAAAATGACA 
AAAAATGACA 
AAAAATGACA 
AAAAATGACA 
AAAAATGACA 
AAAAATGACA 
AAAAATGACA 
AAAAATGACA 
********** 



200 

ATGATAAGTC 
ATGATAAGTC 
ATGATAAGTC 
ATGATAAGTC 
ATGATAAGTC 
ATGATAAGTC 
ATGATAAGTC 
ATGATAAGTC 
ATGATAAGTC 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297. 
msal78297 . 
msal78297. 
msal782 97.2{l04 



2 {l04_090} 
104_18RS2l} 
2{104_2603) 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{104_M781} 

JM9130013} 
Consensus 



201 

AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
********** 



CACGAAACGG 
CACGAAACGG 
CACGAAACGG 
CACGAAACGG 
CACGAAACGG 
CACGAAACGG 
CACGAAACGG 
CACGAAACGG 
CACGAAACGG 
********** 



TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 



TGGAgAAGCA 
TGGAgAAGCA 
TGGAgAAGCA 
TGGArAAGCA 
TGGArAAGCA 
TGGAgAAGCA 
TGGAgAAGCA 
TGGAaAAGCA 
TGGAgAAGCA 



********** ****_***** 



250 

ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
********** 



msal78297 
msal78297.2{ 

maal78297 
msal78297.2{ 
msal78297 . 
msal78297. 
msa!78297 . 
msal78297 . 
msal782 97.2{!04 



.2{104_090} 
104_18RS2l} 
2{104_2603) 
104_CJB110) 
2{104_COH1} 
2{104_M732} 
2{104_A909) 
2{104_M781) 
: _JM9130013} 
Consensus 



251 

ACATAAAACC 
ACATAAAACC 
ACATAAAACC 
ACATAAAACC 
ACATAAAACC 
ACATAAAACC 
ACATAAAACC 
ACATAAAACC 
ACATAAAACC 
********** 



TGGAGACTAC 
TGGAGACTAC 
TGGAGACTAC 
TGGAGACTAC 
TGGAGACTAC 
TGGAGACTAC 
TGGAGACTAC 
TGGAGACTAC 
TGGAGACTAC 
********** 



ACATTAAGAG 
ACATTAAGAG 
ACATTAAGAG 
ACATTAAGAG 
ACATTAAGAG 
ACATTAAGAG 
ACATTAAGAG 
ACATTAAGAG 
ACATTAAGAG 
********** 



AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
********** 



300 

ACCAATTGGT 
ACCAATTGGT 
ACCAATTGGT 
ACCAATTGGT 
ACCAATTGGT 
ACCAATTGGT 
ACCAATTGGT 
ACCAATTGGT 
ACCAATTGGT 
********** 



msal78297 
msal78297 ,2{ 

msa!78297 . 
msal78297.2{ 

msal78297 . 

msal78297. 

msal78297. 

msal78297. 



2{104_090 
104_18RS21 
2{104_2603 
104_CJB110 
2{104_COH1 
2{104_M732 
2f 104_A909 
2{104_M781 



301 

TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 



CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 



CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 



AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 



350 

ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
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msal78297.2{l04_JM9130013} 
Consensus 



TATAAAAAAA CTGATAAAAC CTGGAAAGTT AAAGTTGCAG ATAACGGAGC 
********** ********** ********** ********** ********** 



msal78297 
msal78297.2{ 

msal78297 
msal78297.2{ 
msa!78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090) 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2(l04_COHll 
2{104_M732} 
2{104_A909} 
2{l04_M78l} 
JM9130013} 
Consensus 



351 

AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AmCAATAATC 
AaCAATAATC 
*_******** 



GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
********** 



ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
********** 



AGCAGAGAAA 
AGCAGAGAAA 
AGCAGAGAAA 
AGCAGAGAAA 
AGCAGAGAAA 
AGCAGAGAAA 
AGCAGAGAAA 
AGCAGAGAAA 
AGCAGAGAAA 
********** 



400 

CGAAAAGAAG 
CGAAAAGAAG 
CGAAAAGAAG 
CGAAAAGAAG 
CGAAAAGAAG 
CGAAAAGAAG 
CGAAAAGAAG 
CGAAAAGAAG 
CGAAAAGAAG 
********** 



msal78297 
msal78297.2{ 
msal78297 - 
msal78297.2{ 
msal78297. 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{l04_M732} 
2fl04_A909} 
2{104_M78l} 
_OM9130013} 
Consensus 



401 

TTTTGAATGC 
TTTTGAATGC 
TTTTGAATGC 
TTTTGAATGC 
TTTTGAATGC 
TTTTGAATGC 
TTTTGAATGC 
TTTTGAATGC 
TTTTGAATGC 
********** 



CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
********** 



AAATCAGCTA 
AAATCAGCTA 
AAATCAGCTA 
AAATCAGCTA 
AAATCAGCTA 
AAATCAGCTA 
AAATCAGCTA 
AAATCAGCTA 
AAATCAGCTA 
********** 



TTTATGAGGA 
TTTATGAGGA 
TTTATGAGGA 
TTTATGAGGA 
TTTATGAGGA 
TTTATGAGGA 
TTTATGAGGA 
TTTATGAGGA 
TTTATGAGGA 
********** 



450 

TACAAAAGAA 
TACAAAAGAA 
TACAAAAGAA 
TACAAAAGAA 
TACAAAAGAA 
TACAAAAGAA 
TACAAAAGAA 
TACAAAAGAA 
TACAAAAGAA 
********** 



msal78297 
msal78297.2{ 

msal78297 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 
msal78297 
msal78297.2{!04 



2{104_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{l04_M78l} 
: _JM9130013} 
Consensus 



451 

AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 



TAGTTAATGT 
TAGTTAATGT 
TAGTTAATGT 
TAGTTAATGT 
TAGTTAATGT 
TAGTTAATGT 
TAGTTAATGT 
TAGTTAATGT 
TAGTTAATGT 



AGAGGGTTCC 
AGAGGGTTCC 
AGAGGGTTCC 
AGAGGGTTCC 
AGAGGGTTCC 
AGAGGGTTCC 
AGAGGGTTCC 
AGAGGGTTCC 
AGAGGGTTCC 



AAAGTTGGTG 
AAAGTTGGTG 
AAAGTTGGTG 
AAAGTTGGTG 
AAAGTTGGTG 
AAAGTTGGTG 
AAAGTTGGTG 
AAAGTTGGTG 
AAAGTTGGTG 



500 

AACAATACAA 
AACAATACAA 
AACAATACAA 
AACAATACAA 
AACAATACAA 
AACAATACAA 
AACAATACAA 
AACAATACAA 
AACAATACAA 



********** ********** ********** ********** ********** 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{!04 



2 {104^090} 
104_18RS21} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{l04_M732} 
2{104__A909} 
2{104_M781} 

JM9130013} 
Consensus 



501 

AGCATTGAAT 
AGCATTGAAT 
AGCATTGAAT 
AGCATTGAAT 
AGCATTGAAT 
AGCATTGAAT 
AGCATTGAAT 
AGCATTGAAT 
AGCATTGAAT 



CCAATAAATG 
CCAATAAATG 
CCAATAAATG 
CCAATAAATG 
CCAATAAATG 
CCAATAAATG 
CCAATAAATG 
CCAATAAATG 
CCAATAAATG 



GAAAAGATGG 
GAAAAGATGG 
GAAAAGATGG 
GAAAAGATGG 
GAAAAGATGG 
GAAAAGATGG 
GAAAAGATGG 
GAAAAGATGG 
GAAAAGATGG 



TCGAAGAGAG 
TCGAAGAGAG 
TCGAAGAGAG 
TCGAAGAGAG 
TCGAAGAGAG 
TCGAAGAGAG 
TCGAAGAGAG 
TCGAAGAGAG 
TCGAAGAGAG 



550 

ATTGCTGAAG 
ATTGCTGAAG 
ATTGCTGAAG 
ATTGCTGAAG 
ATTGCTGAAG 
ATTGCTGAAG 
ATTGCTGAAG 
ATTGCTGAAG 
ATTGCTGAAG 



********** ********** ********** ********** ********** 



msal78297 
msal78297 ,2{ 
msal78297 . 
msal78297.2{ 
msal78297. 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{!04 



.2{104_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909) 
2{104_M781} 
_JM9130013} 
Consensus 



551 

GTTGGTTATC 
GTTGGTTATC 
GTTGGTTATC 
GTTGGTTATC 
GTTGGTTATC 
GTTGGTTATC 
GTTGGTTATC 
GTTGGTTATC 
GTTGGTTATC 
********** 



AAAAAAAAtT 
AAAAAAAAtT 
AAAAAAAAtT 
AAAAAAAAtT 
AAAAAAAAaT 
AAAAAAAAaT 
AAAAAAAAtT 
AAAAAAAAtT 
AAAAAAAAtT 
********_* 



ACAGGGGTCA 
ACAGGGGTCA 
ACAGGGGTCA 
ACAGGGGTCA 
ACAGGGGTCA 
ACAGGGGTCA 
ACAGGGGTCA 
ACAGGGGTCA 
ACAGGGGTCA 
********** 



ATGATCTCGA 
ATGATCTCGA 
ATGATCTCGA 
ATGATCTCGA 
ATGATCTCGA 
ATGATCTCGA 
ATGATCTCGA 
ATGATCTCGA 
ATGATCTCGA 
********** 



600 

TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
********** 



601 650 

msal78297.2{l04_090} TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

msal78297.2{l04_18RS2l} TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

msal7 82 97. 2 { 104_2603} TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

msal78297.2{ 1 04_C JB1 1 0 } TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

msal78297 .2{l04_COHl} TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

msal78297 . 2f 104_M732} TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

msal78297.2{l04_A909) TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

msal78297 . 2 { 104_M781 } TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

msal78297.2{l04_JM9130013} TATAAAATTG AATTAACTGT TGAGGGTAAA ACCACTGTTG AAACGAAAGA 

Consensus ********** ********** ********** ********** ********** 



msa!78297. 2 {104^090} 
msal78297 .2 {l04_18RS2l} 
msa!78297 .2{104_2603) 
msal78297 . 2 { 104_CJB110} 
msal78297 . 2 { 104_COHl} 
msal78297 .2f 104_M732) 
msal78297.2{104_A909} 



651 

ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 



CCACTAGATG 
CCACTAGATG 
CCACTAGATG 
CCACTAGATG 
CCACTAGATG 
CCACTAGATG 
CCACTAGATG 



TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 



ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 



700 

TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
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msal7&297 . 2 { 104_M781> 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



ACTTAATCAA C CACTAGATG TCGTTGTGCT ATTAGATAAT TCAAATAGTA 
ACTTAATCAA C CACTAGATG TCGTTGTGCT ATTAGATAAT TCAAATAGTA 
********** ********** ********** ********** ********** 



msal78297 
msal78297.2{ 

msal78297 
msal78297.2{ 
msal78297 . 
msal78297. 
msal78297. 
msal78297 
msal78297.2{l04 



.2{l04_090} 
104_18RS2l} 
2{l04_2603} 
104_CJB110 } 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{104_M78l} 
_JM9130013} 
Consensus 



701 

TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
********** 



AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
********** 



AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
********** 



GAG CATT AAA 
GAG CATTAAA 
GAGCATTAAA 
GAG CATTAAA 
GAGCATTAAA 
GAGCATTAAA 
GAGCATTAAA 
GAGCATTAAA 
GAGCATTAAA 



750 

AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 



********** ********** 



msal78297 
msal78297.2{ 

msal78297 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 
msal78297. 
msal78297.2{!04 



.2{l04_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{104_M781} 
JM9130013} 
Consensus 



751 

GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 



AG CTGATTGA 
AGCTGATTGA 
AG CTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
AGCTGATTGA 



TAAAATTACA 
TAAAATTACA 
TAAAATTACA 
TAAAATTACA 
TAAAATTACA 
TAAAATTACA 
TAAAATTACA 
TAAAATTACA 
TAAAATTACA 



TCAAATAAAG 
TCAAATAAAG 
TCAAATAAAG 
TCAAATAAAG 
TCAAATAAAG 
TCAAATAAAG 
TCAAATAAAG 
TCAAATAAAG 
TCAAATAAAG 



800 

ACAATAGAGT 
ACAATAGAGT 
ACAATAGAGT 
ACAATAGAGT 
ACAATAGAGT 
ACAATAGAGT 
ACAATAGAGT 
ACAATAGAGT 
ACAATAGAGT 



********** ********** ********** ********** ********** 



msal78 
msal78297 
msal782 
msal78297 
msal782 
msal782 
msal782 
msal782 
msal78297.2{ 



297 .2{104_090} 
,2{104_18RS21} 
97.2{104_2603> 
.2{104_CJB110) 
97 .2{l04_COHl} 
97 .2{104_M732} 
97.2{104_A909) 
97 .2{104_M781} 
104_JM9130013} 
Consensus 



801 

AGCTCTTGTG 
AGCTCTTGTG 
AGCTCTTGTG 
AGCTCTTGTG 
AGCTCTTGTG 
AGCTCTTGTG 
AGCTCTTGTG 
AGCTCTTGTG 
AGCTCTTGTG 
********** 



ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
********** 



CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
********** 



TGATGGTACT 
TGATGGTACT 
TGATGGTACT 
TGATGGTACT 
TGATGGTACT 
TGATGGTACT 
TGATGGTACT 
TGATGGTACT 
TGATGGTACT 
********** 



850 

GAAGCGACCG 
GAAGCGACCG 
GAAGCGACCG 
GAAGCGACCG 
GAAGCGACCG 
GAAGCGACCG 
GAAGCGACCG 
GAAGCGACCG 
GAAGCGACCG 
********** 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297 .2{ 
msal78297. 
msa!78297 . 
msal78297 . 
msal78297 
msal78297.2{!04 



.2{l04_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104__COH1} 
2{104_M732> 
2{104_A909} 
2{104_M781} 
JM9130013} 
Consensus 



851 

TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
********** 



AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
********** 



CAAAATGGTA 
CAAAATGGTA 
CAAAATGGTA 
CAAAATGGTA 
CAAAATGGTA 
CAAAATGGTA 
CAAAATGGTA 
CAAAATGGTA 
CAAAATGGTA 



AAGCGCTGAA 
AAGCGCTGAA 
AAGCGCTGAA 
AAGCGCTGAA 
AAGCGCTGAA 
AAGCGCTGAA 
AAGCGCTGAA 
AAGCGCTGAA 
AAGCGCTGAA 



900 

TGATAGTGTA 
TGATAGTGTA 
TGATAGTGTA 
TGATAGTGTA 
TGATAGTGTA 
TGATAGTGTA 
TGATAGTGTA 
TGATAGTGTA 
TGATAGTGTA 



********** ********** ********** 



msal78297 
msal78297 ,2{ 

rasal78297 
msal78297.2{ 
msal78297 . 
msa!78297 . 
msal78297. 
msal78297 . 
msal78297.2{l04 



2{104_090} 
104_18RS2lJ 
2{104_2603] 
104_CJB110] 
2{104_COH1] 
2ll04_M732i 
2{l04_A909 
2{l04_M78i; 

_JM9130013] 
Consensus 



901 

T CATGGG ATT 
TCATGGGATT 
TCATGGGATT 
TCATGGGATT 
TCATGGGATT 
TCATGGGATT 
TCATGGGATT 
TCATGGGATT 
TCATGGGATT 



ATCATAAAAC 
ATCATAAAAC 
ATCATAAAAC 
ATCATAAAAC 
ATCATAAAAC 
ATCATAAAAC 
ATCATAAAAC 
ATCATAAAAC 
ATCATAAAAC 



********** ********** 



TACTTTTACA 
TACTTTTACA 
TACTTTTACA 
TACTTTTACA 
TACTTTTACA 
TACTTTTACA 
TACTTTTACA 
TACTTTTACA 
TACTTTTACA 
********** 



GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
********** 



950 

ATAATTACAG 
ATAATTACAG 
ATAATTACAG 
ATAATTACAG 
ATAATTACAG 
ATAATTACAG 
ATAATTACAG 
ATAATTACAG 
ATAATTACAG 
********** 



msal78297 
msal78297 -2{ 

rasal78297 
msal78297.2{ 
msal78297 . 
msal78297. 
msal78297 
msal78297 
msal78297.2{!04 



2{l04_090} 
104_18RS21} 
2 {104^2603} 
104_CJB110} 
2{104__COH1} 
2{104_M732} 
2{104_A909} 
2{104_M781} 
: _JM9130013} 
Consensus 



951 

TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
********** 



TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
********** 



ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
********** 



AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
********** 



1000 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
********** 



msal78297 .2{104_090> 
msal78297 . 2 { 104_18RS21 } 
msal78297 . 2 { 104_2603 } 
msal78297 . 2 { 104__CJB110 } 
msal78297 . 2 { 104_COH1 } 
msal78297.2{104 M732} 



1001 

GAATTCCAAA 
GAATTCCAAA 
GAATTCCAAA 
GAATTCCAAA 
GAATTCCAAA 
GAATTCCAAA 



GGAAGCGGAG 
GGAAGCGGAG 
GGAAGCGGAG 
GGAAGCGGAG 
GGAAGCGGAG 
GGAAGCGGAG 



CATATAAATG 
CATATAAATG 
CATATAAATG 
CATATAAATG 
CATATAAATG 
CATATAAATG 



GGGATCGCAC 
GGGATCGCAC 
GGGATCGCAC 
GGGATCGCAC 
GGGATCGCAC 
GGGATCGCAC 



1050 
GCTCTATCAA 
GCTCTATCAA 
GCTCTATCAA 
GCTCTATCAA 
GCTCTATCAA 
GCTCTATCAA 



739 



WO 2004/018646 



Table 41: Comparative Sequences relating to SAG0649 



msa!78297 .2fl04_A909} 
msal78297 . 2 { 104__M781 } 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



G AATT C CAAA GGAAGCGGAG CATATAAATG GGGATCGCAC GCTCTATCAA 
GAATTCCAAA GGAAGCGGAG CATATAAATG GGGATCGCAC GCTCTATCAA 
GAATTCCAAA GGAAGCGGAG CATATAAATG GGGATCGCAC GCTCTATCAA 
********** ********** ********** ********** ********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msa!78297.2{l04 



.2{l04_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{l04_A909j 
2{l04_M78l} 
-._JM9130013} 
Consensus 



1051 

TTTGGTGCGA 
TTTGGTG CGA 
TTTGGTGCGA 
TTTGGTGCGA 
TTTGGTGCGA 
TTTGGTGCGA 
TTTGGTGCGA 
TTTGGTGCGA 
TTTGGTGCGA 
********** 



CATTTACTCA 
CATTTACTCA 
CATTTACTCA 
CATTTACTCA 
CATTTACTCA 
CATTTACTCA 
CATTTACTCA 
CATTTACTCA 
CATTTACTCA 
********** 



AAAAGCTCTA 
AAAAGCTCTA 
AAAAGCTCTA 
AAAAGCTCTA 
AAAAGCTCTA 
AAAAGCTCTA 
AAAAGCTCTA 
AAAAGCTCTA 
AAAAGCTCTA 
********** 



ATGAAAGCAA 
ATGAAAGCAA 
ATGAAAGCAA 
ATGAAAGCAA 
ATGAAAGCAA 
ATGAAAGCAA 
ATGAAAGCAA 
ATGAAAGCAA 
ATGAAAGCAA 
********** 



1100 
ATGAAATTTT 
ATGAAATTTT 
ATGAAATTTT 
ATGAAATTTT 
ATGAAATTTT 
ATGAAATTTT 
ATGAAATTTT 
ATGAAATTTT 
ATGAAATTTT 
********** 



msa!78297 
msal78297 .2{ 
msal78297 . 
msal78297.2{ 
msa!78297 
msal78297 . 
msal78297 . 
msal78297 
msal78297.2{!04 



2{104_090} 
104_18RS2l) 
2{104_2603) 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{l04_M78l} 
t _JM9130013} 
Consensus 



1101 

AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
********** 



AGTTCTAATG 
AGTTCTAATG 
AGTTCTAATG 
AGTTCTAATG 
AGTTCTAATG 
AGTTCTAATG 
AGTTCTAATG 
AGTTCTAATG 
AGTTCTAATG 



CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 



********** ********** 



ACTTATTTTT 
ACTTATTTTT 
ACTTATTTTT 
ACTTATTTTT 
ACTTATTTTT 
ACTTATTTTT 
ACTTATTTTT 
ACTTATTTTT 
ACTTATTTTT 
********** 



1150 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
********** 



msal78297 
msal78297.2{ 

msal78297 
msal78297.2{ 
msal78297 
msal78297 . 
msal78297 . 
msal78297 
msal78297.2{!04 



.2{104_090} 
104_18RS2l) 
2{104_2603} 
104_CJB110} 
2{l04_COHl} 
2{l04_M732} 
2{104_A909} 
2{104_M781} 
JM9130013} 
Consensus 



1151 

ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
********** 



TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
********** 



TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
********** 



ATTTTAATCC 
ATTTTAATCC 
ATTTTAATCC 
ATTTTAATCC 
ATTTTAATCC 
ATTTTAATCC 
ATTTTAATCC 
ATTTTAATCC 
ATTTTAATCC 
********** 



1200 
TTATATATCA 
TTATATATCA 
TTATATATCA 
TTATATATCA 
TTATATATCA 
TTATATATCA 
TTATATATCA 
TTATATATCA 
TTATATATCA 
********** 



msal78297 
msal78297 .2{ 
msal78297 
. msa!78297 .2{ 
tnsal78297 . 
msal78297 
msal78297 . 
msal78297 . 
msal78297.2{!04 



2{104_090} 
104_18RS2l} 
2 {104^2603} 
104_CJB110} 
2{104_COH1) 
2{104_M732 J 
2{l04_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



1201 

ACATCTTACC 
ACATCTTACC 
ACATCTTACC 
ACATCTTACC 
ACATCTTACC 
ACATCTTACC 
ACATCTTACC 
ACATCTTACC 
ACATCTTACC 
********** 



AAAACCAGTT 
AAAACCAGTT 
AAAACCAGTT 
AAAACCAGTT 
AAAACCAGTT 
AAAACCAGTT 
AAAACCAGTT 
AAAACCAGTT 
AAAACCAGTT 
********** 



TAATTCTTTT 
TAATTCTTTT 
TAATTCTTTT 
TAATTCTTTT 
TAATTCTTTT 
TAATTCTTTT 
TAATTCTTTT 
TAATTCTTTT 
TAATTCTTTT 
********** 



TTAAATAAAA 
TTAAATAAAA 
TTAAATAAAA 
TTAAATAAAA 
TTAAATAAAA 
TTAAATAAAA 
TTAAATAAAA 
TTAAATAAAA 
TTAAATAAAA 
********** 



1250 
TACCAGATAG 
TACCAGATAG 
TACCAGATAG 
TACCAGATAG 
TACCAGATAG 
TACCAGATAG 
TACCAGATAG 
TACCAGATAG 
TACCAGATAG 
********** 



msa!78297 
msal78297.2{ 

msal782 97 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 
msal78297 
msa!78297.2{l04, 



2{l04_090j 
104_18RS21j 
2{104_2603] 
104_CJB110] 
2{104_COH1] 
2{104_M732 1 
2{104_A909' 
2{l04_M78l' 
JM9130013; 
Consensus 



1251 

AAGTGGTATT 
AAGTGGTATT 
AAGTGGTATT 
AAGTGGTATT 
AAGTGGTATT 
AAGTGGTATT 
AAGTGGTATT 
AAGTGGTATT 
AAGTGGTATT 
********** 



CTCCAAGAGG 
CTCCAAGAGG 
CTCCAAGAGG 
CTCCAAGAGG 
CTCCAAGAGG 
CTCCAAGAGG 
CTCCAAGAGG 
CTCCAAGAGG 
CTCCAAGAGG 
********** 



ATTTTATAAT 
ATTTTATAAT 
ATTTTATAAT 
ATTTTATAAT 
ATTTTATAAT 
ATTTTATAAT 
ATTTTATAAT 
ATTTTATAAT 
ATTTTATAAT 
********** 



CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
********** 



1300 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 
msal78297 . 
msal78297 . 
msal78297.2{l04 



2{104_090; 
104__18RS21' 
2{104_2603] 
104_CJB110] 
2{104_COH1] 
2{104_M732| 
2{104_A909' 
2{104__M781} 

JM9130013) 
Consensus 



1301 

TAGTAAAAGG 
TAGTAAAAGG 
TAGTAAAAGG 
TAGTAAAAGG 
TAGTAAAAGG 
TAGTAAAAGG 
TAGTAAAAGG 
TAGTAAAAGG 
TAGTAAAAGG 



AGATGGAGAG 
AGATGGAGAG 
AGATGGAGAG 
AGATGGAGAG 
AGATGGAGAG 
AGATGGAGAG 
AGATGGAGAG 
AGATGGAGAG 
AGATGGAGAG 



********** ********** 



AGTTTTAAAC 
AGTTTTAAAC 
AGTTTTAAAC 
AGTTTTAAAC 
AGTTTTAAAC 
AGTTTTAAAC 
AGTTTTAAAC 
AGTTTTAAAC 
AGTTTTAAAC 
********** 



TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
********** 



1350 
TAGAAAAGTT 
TAGAAAAGTT 
TAGAAAAGTT 
TAGAAAAGTT 
TAGAAAAGTT 
TAGAAAAGTT 
TAGAAAAGTT 
TAGAAAAGTT 
TAGAAAAGTT 
********** 



msal78297.2{l04_090} 
msal782 97 . 2 { 104_18RS21 } 

msal78297 .2{104_2603} 
msal78297 . 2 { 104_CJB110j 

msa!782 97 . 2 { 104_COH1 } 



1351 

CCTGTTACTG 
CCTGTTACTG 
CCTGTTACTG 
CCTGTTACTG 
CCTGTTACTG 



GAGGAACGAC ACAAGCAGCT TATCGAGTAC 
GAGGAACGAC ACAAGCAGCT TATCGAGTAC 
GAGGAACGAC ACAAGCAGCT TATCGAGTAC 
GAGGAACGAC ACAAGCAGCT TATCGAGTAC 
GAGGAACGAC ACAAGCAGCT TATCGAGTAC 



1400 
CGCAAAATCA 
CGCAAAATCA 
CGCAAAATCA 
CGCAAAATCA 
CGCAAAATCA 



740 



WO 2004/018646 



PCT/US2003/026827 



Table 41: Comparative Sequences relating to SAG0649 



msal78297 . 2 { 104_M732 } 
msal78297 . 2 { 104_A909 } 
msa!78297.2{l04_M78l} 
msal78297.2{l04_JM9130013} 
Consensus 



msal78297.2{l04_090} 
msal78297 . 2 { 104_18RS21 \ 
msal78297.2{l04_2603} 
msal78297 . 2 { 104_GJB110 } 
msal78297 .2{104_COH1} 
rasal78297 .2{l04_M732 \ 
msal78297 .2{l04_A909) 
msal78297.2{l04_M78l} 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



msal78297.2{l04_090} 
msal782 97 . 2 { 104_18RS2 1 } 
msal78297 .2{104_2603} 
msal78297 . 2 { 104_CJB110 } 
msal 7 8 2 97 . 2 { 104_COH1 } 
msal78297 . 2 { 104_M73 2 } 
msal78297 . 2 { 104__A90 9 } 
msal78297 .2{104_M781} 
msal782 97.2{l04_JM9130013} 
Consensus 



msal78297.2{!04_090; 
msal78297.2{l04_18RS2i; 

msal78297.2{l04_2603; 
msal78297.2{!04 CJB110] 
ms al7 82 97 . 2 { 104_COH1 ' 
msal782 97.2{l04_M732; 
msa!78297 . 2 { 104_A909 . 
msal78297.2{l04_M78l] 
msal78297 . 2 {l04_JM9130013 j 
Consensus 



CCTGTTACTG GAGGAACGAC ACAAGCAGCT TATCGAGTAC CGCAAAATCA 
CCTGTTACTG GAGGAACGAC ACAAGCAGCT TATCGAGTAC CGCAAAATCA 
CCTGTTACTG GAGGAACGAC ACAAGCAGCT TATCGAGTAC CGCAAAATCA 
CCTGTTACTG GAGGAACGAC ACAAGCAGCT TATCGAGTAC CGCAAAATCA 
********** ********** ********** ********** ********** 



msal78297 
tnsal78297.2{ 
msal78297. 
msal78297.2{ 
msal78297 . 
msal78297. 
msal78297. 
msal78297 
msal78297.2{l04 



2{l04_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110> 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{104_M781} 
JM9130013} 
Consensus 



msal78297 . 2 { 104_090 } 
msal78297.2{l04_18RS2l} 
msal78297 . 2 { 104_2603 } 
msal78297.2{l04_CJB110) 
msal78297 . 2 { 104_COH1} 
msal78297 . 2 { 104_M732 } 
msal78297 . 2 { 104_A909 } 
msal78297 . 2 { 104_M78 1 } 
msal78297.2{l04_JM9130013} 
Consensus 



msal78297.2{l04_090} 
msal78297 . 2 { 104_18RS2l} 
msal78297 .2{104_2603} 
msal78297 . 2 { 104_CJB110 } 
msa!78297 . 2 ( 104__COHl} 
msal78297 . 2 { 104_M732 } 
msal782 97 . 2f 104__A909 } 
msal78297.2{104_M781> 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



1401 

ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
********** 

1451 

ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
********** 

1501 

AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
********** 

1551 

ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
********** 

1601 

TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
********** 

1651 

AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
********** 



ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
********** 



GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
********** 



AGGGATATGC 
AGGGATATGC 
AGGGATATGC 
AGGGATATGC 
AGGGATATGC 
AGGGATATGC 
AGGGATATGC 
AGGGATATGC 
AGGGATATGC 
********** 



AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
********** 



CTGCAACGAA 
CTGCAACGAA 
CTGCAACGAA 
CTGCAACGAA 
CTGCAACGAA 
CTGCAACGAA 
CTGCAACGAA 
CTGCAACGAA 
CTGCAACGAA 
********** 



AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
********** 



TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
********** 



ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
********** 



AATTAATAGT 
AATTAATAGT 
AATTAATAGT 
AATTAATAGT 
AATTAATAGT 
AATTAATAGT 
AATTAATAGT 
AATTAATAGT 
AATTAATAGT 
********** 



ATCCATTTGA 
ATCCATTTGA 
ATCCATTTGA 
ATCCATTTGA 
ATCCATTTGA 
ATCCATTTGA 
ATCCATTTGA 
ATCCATTTGA 
ATCCATTTGA 
********** 



ACT CATGGTG 
ACTCATGGTG 
ACT CATGGTG 
ACTCATGGTG 
ACTCATGGTG 
ACTCATGGTG 
ACTCATGGTG 
ACTCATGGTG 
ACTCATGGTG 
********** 



TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
********** 



GATCCTGGTG 
GATCCTGGTG 
GATCCTGGTG 
GATCCTGGTG 
GATCCTGGTG 
GATCCTGGTG 
GATCCTGGTG 
GATCCTGGTG 
GATCCTGGTG 
********** 



AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
********** 



CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
********** 



AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 



GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 



AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 

********** ********** ********** ********** 



1450 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
********** 

1500 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
********** 

1550 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
********** 

1600 
ATTTTTACTG 
ATTTTTACTG 
ATTTTTACTG 
ATTTTTACTG 
ATTTTTACTG 
ATTTTTACTG 
ATTTTTACTG 
ATTTTTACTG 
ATTTTTACTG 
********** 

1650 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
********** 

1700 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 



maal78297.2{!04_090} 
msal78 2 97 . 2 { 104_18RS2 1 } 
msal78297.2{104_2603} 
msal78297.2{l04_CJB110} 



1701 1750 
TGATACAAAT AAAATTTATG ATGAGCTAAA TAAATACTTT AAAACAATTG 
TGATACAAAT AAAATTTATG ATGAGCTAAA TAAATACTTT AAAACAATTG 
TGATACAAAT AAAATTTATG ATGAGCTAAA TAAATACTTT AAAACAATTG 
TGATACAAAT AAAATTTATG ATGAGCTAAA TAAATACTTT AAAACAATTG 
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Table 41: Comparative Sequences relating to SAG0649 



msal78297 . 2 ( 104_COH1 } 
msa!78297 . 2 { 104_M732 } 
msal78297 .2{l04_A909} 
msal78297 . 2{ 104_M78l} 
msal78297.2{l04_JM9130013} 
Consensus 



msal78297.2{l04_090} 
msal78297 . 2 { 104_18RS2l} 
msal78297.2{!04_2603} 
ms al 7 8 2 97 . 2 { 1 0 4_C JBl 1 0 } 
msal78297 . 2 { 104_COHl) 
msal78297 . 2{l04_M732} 
msal78297 .2{l04_A909} 
msal78297.2{l04_M78l} 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



msal78297.2{l04_090} 
msal78297.2{l04__18RS2l} 
msal78297.2{l04_2603} 
msal78297 . 2 { 104_CJB110 } 
msal7 82 97 . 2 { 104 JIOH1 } 
msal78297 . 2 { 104_M732 } 
msal782 97.2{l04_A909} 
msal78297.2{l04_M78l} 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



msal78297-2{l04_090} 
msal782 97 .2{104__18RS21} 
msal78297 . 2 { 104__2603 } 
msal78297 . 2 { 104_CJB110 > 
msal78297 . 2 { 104_COH1 } 
msal78297.2{l04_M732} 
msal78297.2{l04_A909} 
msal782 97 - 2 { 104_M78 1 } 
msal78297.2{l04_JM9130013} 
Consensus 



msal78297. 2 {l04_090} 
msal78297.2{l04_18RS2l} 
msal78297 . 2{ 104_2 603 } 
msal782 97 . 2 { 104_CJB110 } 
msal78297.2{l04_COHl} 
msa!78297 . 2 { 104_M732 } 
msal78297 .2{104_A909) 
msal78297.2{l04~M781j 
msal78297.2{l04_JM9130013} 
Consensus 



TGATACAAAT 
TGATACAAAT 
TGATACAAAT 
TGATACAAAT 
TGATACAAAT 
********** 

1751 

TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
********** 

1801 

GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
********** 

1851 

TTACGTTTTG 
TTACGTTTTG 
TTACGTTTTG 
TTACGTTTTG 
TTACGTTTTG 
TTACGTTTTG 
TTACGTTTTG 
TTACGTTTTG 
TTACGTTTTG 
********** 

1901 

TTGGTGGACC 
TTGGTGGACC 
TTGGTGGACC 
TTGGTGGACC 
TTGGTGGACC 
TTGGTGGACC 
TTGGTGGACC 
TTGGTGGACC 
TTGGTGGACC 



AAAATTTATG 
AAAATTTATG 
AAAATTTATG 
AAAATTTATG 
AAAATTTATG 
********** 



ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
********** 



ATGAGCTAAA 
ATGAGCTAAA 
ATGAGCTAAA 
ATGAGCTAAA 
ATGAGCTAAA 
********** 



GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
********** 



AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
********** 



GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
********** 



AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
********** 



TAAATACTTT 
TAAATACTTT 
TAAATACTTT 
TAAATACTTT 
TAAATACTTT 
********** 



ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
********** 



CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
********** 



ATGGCAGTCA 
ATGG CAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
********** 



ATTAAAAAAT 
ATTAAAAAAT 
ATTAAAAAAT 
ATTAAAAAAT 
ATTAAAAAAT 
ATTAAAAAAT 
ATTAAAAAAT 
ATTAAAAAAT 
ATTAAAAAAT 
********** 



AAACAGTGAT GGGGGAATTT TAAAAGATGT 
AAACAGTGAT GGGGGAATTT TAAAAGATGT 
AAACAGTGAT GGGGGAATTT TAAAAGATGT 
AAACAGTGAT GGGGGAATTT TAAAAGATGT 
AAACAGTGAT GGGGGAATTT TAAAAGATGT 
AAACAGTGAT GGGGGAATTT TAAAAGATGT 
AAACAGTGAT GGGGGAATTT TAAAAGATGT 
AAACAGTGAT GGGGGAATTT TAAAAGATGT 
AAACAGTGAT GGGGGAATTT TAAAAGATGT 
********** ********** ********** ********** 



AAAACAATTG 
AAAACAATTG 
AAAACAATTG 
AAAACAATTG 
AAAACAATTG 
********** 

1800 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
********** 

1850 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
********** 

1900 
GGTGTGGCTC 
GGTGTGGCTC 
GGTGTGGCTC 
GGTGTGGCTC 
GGTGTGGCTC 
GGTGTGGCTC 
GGTGTGGCTC 
GGTGTGGCTC 
GGTGTGGCTC 
********** 

1950 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
********** 



msal78297 . 2 { 104_090 } 
msal78297 . 2 { 104_18RS21 } 
msal78297 .2{104_2603} 
msal78297.2{l04_CJB110} 
msal78297.2{l04_COHl) 
msal78297.2{l04_M732} 
msal78297 .2f 104_A909) 
msal78297 . 2 { 104_M78l} 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



msal78297 .2{104_090) 
msal78297.2{l04_18RS2l) 
msal78297 . 2 { 104_2603 } 
msal78297 . 2 { 104_CJB110} 
msal78297 .2{104_COH1) 
msal78297.2(l04_M732) 
msal78297.2{l04_A909} 
msal78297.2{l04_M78l} 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



msal78297.2{!04_090} 
msal78297 . 2 { 104_18RS2l) 
msal78297 . 2 { 104_2603 } 



1951 

TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
********** 

2001 

TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
********** 



CATCTCAAAC 
CATCTCAAAC 
CATCTCAAAC 
CATCTCAAAC 
CATCTCAAAC 
CATCTCAAAC 
CATCTCAAAC 
CATCTCAAAC 
CATCTCAAAC 
********** 



CATCAAAATC 
CATCAAAATC 
CATCAAAATC 
CATCAAAATC 
CATCAAAATC 
CATCAAAATC 
CATCAAAATC 
CATCAAAATC 
CATCAAAATC 



AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 



2000 
ACTTAGGAAG 
ACTTAGGAAG 
ACTTAGGAAG 
ACTTAGGAAG 
ACTTAGGAAG 
ACTTAGGAAG 
ACTTAGGAAG 
ACTTAGGAAG 
ACTTAGGAAG 



********** ********** ********** 



GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
********** 



CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
********** 



ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
********** 



2050 
GATAACTATA 
GATAACTATA 
GATAACTATA 
GATAACTATA 
GATAACTATA 
GATAACTATA 
GATAACTATA 
GATAACTATA 
GATAACTATA 
********** 



2051 2100 
TAAGTAACAA ATT TTACAAT ACAAATAATC GTACAACGCT AAGTCCGAAG 
TAAGTAACAA ATT TTACAAT ACAAATAATC GTACAACGCT AAGTCCGAAG 
TAAGTAACAA ATTTTACAAT ACAAATAATC GTACAACGCT AAGTCCGAAG 
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Table 41: Comparative Sequences relating to SAG0649 



msal78297 . 2 { 104__CJB110 } 
msal782 97 . 2 { 104_COH1 } 
msal78297 . 2 { 104_M732 } 
msal78297 .2{l04_A909} 
msal78297.2{l04_M78l} 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



TAAGTAACAA 
TAAGTAACAA 
TAAGTAACAA 
TAAGTAACAA 
TAAGTAACAA 
TAAGTAACAA 



ATTTTACAAT 
ATTTTACAAT 
ATTTTACAAT 
ATTTTACAAT 
ATTTTACAAT 
ATTTTACAAT 



ACAAATAATC 
ACAAATAATC 
ACAAATAATC 
ACAAATAATC 
ACAAATAATC 
ACAAATAATC 



GTACAACGCT 
GTACAACGCT 
GTACAACGCT 
GTACAACGCT 
GTACAACGCT 
GTACAACGCT 



AAGTCCGAAG 
AAGTCCGAAG 
AAGTCCGAAG 
AAGTCCGAAG 
AAGTCCGAAG 
AAGTCCGAAG 



********** ********** ********** ********** ********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
Tnsal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



2{104_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909) 
2{l04_M78l} 
JM9130013} 
Consensus 



2101 

AGTGAAAAAG 
AGTGAAAAAG 
AGTGAAAAAG 
AGTGAAAAAG 
AGTGAAAAAG 
AGTGAAAAAG 
AGTGAAAAAG 
AGTGAAAAAG 
AGTGAAAAAG 



AAC CARAT AC 
AACCAAATAC 
AACCAAATAC 
AACCAAATAC 
AACCAAATAC 
AACCAAATAC 
AACCAAATAC 
AACCAAATAC 
AACCAAATAC 



TATTCGTGAT 
TATTCGTGAT 
TATTCGTGAT 
TATTCGTGAT 
TATTCGTGAT 
TATTCGTGAT 
TATTCGTGAT 
TATTCGTGAT 
TATTCGTGAT 



********** ********** ********** 



TTCCCAATTC 
TTCCCAATTC 
TTCCCAATTC 
TTCCCAATTC 
TTCCCAATTC 
TTCCCAATTC 
TTCCCAATTC 
TTCCCAATTC 
TTCCCAATTC 
********** 



2150 
CCAAAATTCG 
CCAAAATTCG 
CCAAAATTCG 
CCAAAATTCG 
CCAAAATTCG 
CCAAAATTCG 
CCAAAATTCG 
CCAAAATTCG 
CCAAAATTCG 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msal78297 
msal78297 
msal78297 
msal78297 
msal78297.2{l04 



2{104_090} 
104_18RS2l} 
2{104_2603} 
104 CJB110} 
2 { 1 04_COH1 } 
2{104_M732} 
2(104_A909} 
2{104_M781} 
JM9130013} 
Consensus 



2151 

TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
********** 



GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
********** 



TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
********** 



CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAAT CAa 



2200 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 



*********_ ********** 



msal78297 
msal78297.2{ 

msal78297 
msal78297.2{ 
msa!78297 . 
msal78297. 
msal78297 . 
msal78297 . 
msal78297.2{l04 



2{104_090} 
104__18RS2l} 
2{l04_2603j 
104_CJB110} 
2{104_COH1} 
2{104_M732J 
2{104_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



2201 

GTGAGGTTGA 
GTGAGGTTGA 
GTGAGGTTGA 
GTGAGGTTGA 
GTGAGGTTGA 
GTGAGGTTGA 
GTGAGGTTGA 
GTGAGGTTGA 
GTGAGGTTGA 
********** 



ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
********** 



GTTAATAAAG 
GTTAATAAAG 
GTTAATAAAG 
GTTAATAAAG 
GTTAATAAAG 
GTTAATAAAG 
GTTAATAAAG 
GTTAATAAAG 
GTTAATAAAG 
********** 



ACAAACATTC 
ACAAACATTC 
ACAAACATTC 
ACAAACATTC 
ACAAACATTC 
ACAAACATTC 
ACAAACATTC 
ACAAACATTC 
ACAAACATTC 
********** 



2250 
AGAATCGCTT 
AGAATCGCTT 
AGAATCGCTT 
AGAATCGCTT 
AGAATCGCTT 
AGAATCGCTT 
AGAATCGCTT 
AGAATCGCTT 
AGAATCGCTT 
********** 



msal78297 
msal78297.2{ 
msal78297. 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297. 
msal78297.2{l04 



2{104_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104__COH1} 
2{104_M732} 
2{104_A909} 
2{104_M781} 
JM9130013} 
Consensus 



2251 

TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 



AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 



TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAaAA 



AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 



2300 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 



msal78297 
msal78297.2{ 

msal78297 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 
msal78297.2{l04 



.2{l04_090j 
104_18RS2lJ 
2{104__2603) 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{104_M781} 
JM9130013} 
Consensus 



2301 

GCAATTTGTT 
GCAATTTGTT 
GCAATTTGTT 
GCAATTTGTT 
GCAATTTGTT 
GCAATTTGTT 
GCAATTTGTT 
GCAATTTGTT 
GCAATTTGTT 
********** 



CCAGAGGGAA 
CCAGAGGGAA 
CCAGAGGGAA 
CCAGAGGGAA 
CCAGAGGGAA 
CCAGAGGGAA 
CCAGAGGGAA 
CCAGAGGGAA 
CCAGAGGGAA 
********** 



GT GATGTTAC 
GTGATGTTAC 
GTGATGTTAC 
GTGATGTTAC 
GTGATGTTAC 
GTGATGTTAC 
GTGATGTTAC 
GTGATGTTAC 
GTGATGTTAC 
********** 



AACAAAGAAT 
AACAAAGAAT 
AACAAAGAAT 
AACAAAGAAT 
AACAAAGAAT 
AACAAAGAAT 
AACAAAGAAT 
AACAAAGAAT 
AACAAAGAAT 
********** 



2350 
GATGGTAAAA 
GATGGTAAAA 
GATGGTAAAA 
GATGGTAAAA 
GATGGTAAAA 
GATGGTAAAA 
GATGGTAAAA 
GATGGTAAAA 
GATGGTAAAA 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msa!78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{!04 



•2{l04_090} 
104_18RS21} 
2{104_2603} 
104_CJB110} 
2{104_COH1) 
2{104_M732} 
2{104_A909} 
2{104_M781} 
JM9130013} 
Consensus 



2351 

TTTATTTTAA 
TTTATTTTAA 
TTTATTTTAA 
TTTATTTTAA 
TTTATTTTAA 
TTTATTTTAA 
TTTATTTTAA 
TTTATTTTAA 
TTTATTTTAA 
********** 



AGCACTTCAA 
AGCACTTCAA 
AGCACTTCAA 
AGCACTTCAA 
AGCACTTCAA 
AGCACTTCAA 
AGCACTTCAA 
AGCACTTCAA 
AGCACTTCAA 
********** 



GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
********** 



ATAAAT TATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
********** 



2400 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
********** 



msal78297 .2{l04_090) 
msa!78297 .2{104_18RS21} 



2401 2450 
AGTCCAGATG GCTATATAGA GGTTAAAACG AAACCTGTTG TGACATTTAC 
AGTCCAGATG GCTATATAGA GGTTAAAACG AAACCTGTTG TGACATTTAC 
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Table 41: Comparative Sequences relating to SAG0649 



msa!78297 . 2 { 104_2603 } 
msal78297 . 2 { 104_CJB110 } 
msal78297.2{l04_COHl} 
msa!78297 . 2 { 104_M732 } 
m3al78297 . 2 ( 104_A909 ) 
msal78297 . 2 { 104_M781 } 
msal78297 ♦ 2 { 104_JM9130013 } 
Consensus 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 . 
. msal78297. 
msal78297 . 
msal78297.2{l04 



.2{104_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732) 
2{104_A909} 
2{104_M781} 
: _JM9130013} 
Consensus 



msal78297.2{l04_090} 
msal78297 . 2 { 104_18RS2l} 
msal78297 . 2 { 104_2603 } 
msal78297 . 2 { 104_CJB110 } 
msal782 97 . 2 { 104_COH1 } 
msa!78297 . 2 ( 104_M732 } 
msal78297.2{l04_A909} 
msal78297 . 2 { 104_M781 } 
msal78297 . 2 { 104__JM9130013 } 
Consensus 



AGTCCAGATG GCTATATAGA 
AGTCCAGATG GCTATATAGA 
AGTCCAGATG GCTATATAGA 
AGTCCAGATG GCTATATAGA 
AGTCCAGATG GCTATATAGA 
AGTCCAGATG GCTATATAGA 
AGTCCAGATG GCTATATAGA 
********** 

2451 

AATTCAAAAT GGAGAAGTTA 
AATTCAAAAT GGAGAAGTTA 
AATTCAAAAT GGAGAAGTTA 
AATTCAAAAT GGAGAAGTTA 
AATTCAAAAT GGAGAAGTTA 
AATTCAAAAT GGAGAAGTTA 
AATTCAAAAT GGAGAAGTTA 
AATTCAAAAT GGAGAAGTTA 
AATTCAAAAT GGAGAAGTTA 
********** ********** 



GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
********** ********** 



AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
********** 



CGAACCTGAA 
CGAACCTGAA 
CGAACCTGAA 
CGAACCTGAA 
CGAACCTGAA 
CGAACCTGAA 
CGAACCTGAA 
CGAACCTGAA 
CGAACCTGAA 
********** 



2501 

AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
********** 



CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
********** 



GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 

GAA 

*** 



AGCAGATCCA 
AGCAGATCCA 
AGCAGATCCA 
AGCAGATCCA 
AGCAGATCCA 
AGCAGATCCA 
AGCAGATCCA 
AGCAGATCCA 
AGCAGATCCA 
********** 



gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 



TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
********** 

2500 
AATGCTAATA 
AATGCTAATA 
AATGCTAATA 
AATGCTAATA 
AATGCTAATA 
AATGCTAATA 
AATGCTAATA 
AATGCTAATA 
AATGCTAATA 
********** 

2550 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 



msal78297.2{l04_090} 
msal78297 . 2 { 104_JL8RS21 } 
msal78297 . 2 { 104_2603 } 
msal78297 . 2 { 104_CJB110 } 
msal78297.2{l04_COHl} 
msal78297 . 2 { 104_M732 } 
msal7 82 97 - 2 { 104_A909 } 
msal78297.2{l04_M78l} 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



2551 2600 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtttttcct aaaacagggg gaattggtac 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 



_****** ********** ********** 



msal78297 ,2{104_090) 
msal78297 . 2 {l04_18RS2l} 
msal78297.2{l04_2603} 
msal78297 - 2 { 104_CJB110 } 
msal78297.2{l04_COHl} 
rasal78297.2{l04_M732} 
msal78297.2{l04_A909} 
msal78297.2{l04_M78l} 
msal78297 . 2 { 104_JM9130013 } 
Consensus 



aattgtctat atattagttg gttctacttt tatgatactt accatttgtt 



********** ********** ********** ********** ********** 



msal78297.2{l04_090} 
msal78297 . 2 { 104_18RS2l} 
msal78297 .2{104_2603} 
msal78297 .2{104_CJB110} 
msal78297.2{l04_COHlj 
msal78297 .2{104_M732) 
msal78297 .2{104_A909} 
msal78297.2{l04_M78l} 
rasal78297 . 2 { 104_JM9130013 } 
Consensus 



2651 



2670 



ctttccgtcg taaacaattg 



********** ********** 
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SEQ ID NO. 4110: SAG0649 FROM 2603 V/R GBS TYPE V STRAIN 
MKKRQKI WRGLS VTLLI LSQ I PFG I LVCGETQDTNQALGKVI VKKTGDNATPLGKATFVL 
KNDNDKSETSHETVEGSGEATFENI KPGD YTLREETAP I G YKKTDKT WKVKVADNGATI I 
EGMDADKAEKRKEVLNAQY PKSAI YEDTKENYPLVNVEGSKVGEQYKALNP I NGKDGRRE 
I AEGWLSKKI TG VNDLD KNKYKI ELTVEGKTTVETKELNQPLDVVVLLDtf SNSMNNERAN 
NSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST I FDGTEATVSKGVADQNGKALNDSV 
S WDYH KTTFT ATTHNYS YLNLTND ANE VN ILKSRI PKEAEHINGDRTLYQFGATFTQKAL 
MKANEILETQSSNARKKLI FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKI PDRSGI 
LQEDFIINGDDYQIVKGDGESFKLFSDRRVPVTGGTTQAAYRVPQNQLSVMSNEGYAINS 
G Y I YL YWRDYNWVYP FDPKTKKVS ATKQ I KTHGE PTTLYFNGN I RPKGYD I FTVG I GVNG 
DPGATPLEAEKFMQS I SSKTENYTNVDDTNKI YDELNKYFKT I VEEKHS I VDGNVTDPMG 
EM I EFQLKNGQS FTHDDYVLVGNDGSQLKNGVALGGPNSDGG ILKDVTVTYDKTSQTI KI 
NHLNLGSGQKVVLTYDVRLKDNYI SNKFYNTNNRTTLSPKSEKEPNT IRDFPI PKI RDVR 
EFPVLTI SNQKKMGEVEFI KVNKDKHSESLLGAKFQLQI EKDFSGYKQFVPEGSDVTTKN 
DGKIYFKALQIXSNYKLYEISSPDGYIEVKTKPVVTFTIQNGEVTNLKADPNANKNQIGYL 
EGNGKHL I TNTP KRPPGVF PKTGG I GT I VY I LVGSTFM I LT I CS FRRKQL 

SEQ ID NO. 4111: SAGO 649 FROM 090 GBS TYPE la STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVL KNDNDKSETSHETVEGSGEATFENI KPG 
DYTLREETAP I GYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGS KVGEQ YKALNP I NGKDGRRE I AEGWLSKKI TGVNDLDKNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVA 
LVTYASTIFDGTEATVSKGVADQNGKAI^SVSWDYHKTTFTATTHNYSYLNLTNDANEV 
NILKSRI PKEAEHI NGDRTLYQFGATFTQKALMKANE I LETQS SNARKKL I FHVTDG VPT 
MSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDR 
KVP VTGGTTQAAYRVPQNQLS VMSNEG YA I NSGY I YL YWRDYNWVYP FDPKTKKVS ATKQ. 
I KTHGEPTTLYFNGNI RPKGYD I FTVG I GVNGDPGAT PLEAEKFMQS I SS KTENYTNVDD 
TNKIYDELNKYFKTIVEEKHSIVDGNVTDPMGEMIEFQLKNGQSFTHDDYVLVGNDGSQL 
KNGVALGGPNSDGG I LKDVTVTYDKTSQT I KINHLNLGSGQKVVLTYDVRLKDNYISNKF 
YNTNNRTTLS PKSE KEPNT I RD FP I PKI RD VREFP VLTI SNQKKMGEVEF I KVNKDKHS E 
SLLGAKFQLQIEKDFSGYKQFVPEGSDOTTKNTX3KIYFKALQDGNYKLYEISSPDGYIEV 
KTKPWTFTI QNGEVTNLKADPNANKNQI GYLEGNGKHL I TNTPKRPPGV 

SEQ ID NO. 4112: SAGO 649 FROM A909 GBS TYPE la STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVL KNDNDKSETSHETVEGSGEATFENI KPG 
DYTLREETAP I GYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGS KVGEQYKALNPI NGKDGRRE I AEGWLSKKITGVNDLD KNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVA 
LVTYAST I FDGTEATVS KG VAD QNG KALND S VS WD YH KTT FT ATTHNYS YLNLTND ANE V 
NI LKSRI PKEAEHINGDRTLYQFGATFTQKALMKANEI LETQSSNARKKLI FHVTDGVPT 
MSYAINFNPYISTSYQNQFNSFLNKI PDRSG I LQEDF 1 1 NGDDYQ I VKGDGES FKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEG YA INSG Y I YL YWRDYNWVYP FDPKTKKVSATKQ 
I KTHGEPTTLYFNGNI RPKGYD I FTVG I GVNGDPGAT PLEAEKFMQS I SSKTENYTNVDD 
TNKIYDELNKYFKT I VEEKHS I VDGNVTDPMG EM I EFQLKNGQS FTHDDYVLVGNDGSQL 
KNGVALGGPNSDGGILKDVTVTYDKTSQTI KI NHLNLGSGQKVVLTYDVRLKDNYI SNKF 
YNTNNRTTLS PKSE KEPNT I RDFP I PKI RDWEFP VLTI SNQKKMGEVEF I KWKDKHSE 
SLLGAKFQLQ I E KDFSG YKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I S SPDGY I E V 
KTKPVVT FT I QNGEVTNLKADPNANKNQI GYLEGNGKHL I TNTPKRPPGV 

SEQ ID NO. 4113: SAG0649 FROM 18RS21 GBS TYPE II STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVL KNDNDKSETSHETVEGSGEATFENI KPG 
DYTLREETAP I GYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENY PLVNVEGS KVGEQYKALNP I NGKDGRRE I AEGWLSKKI TGVNDLDKNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNS QRALKAGEAVE KL IDKITSNKDNRVA 
LVTYASTIFDGTEATVSKGVADQNGKAI^SVSWDYHKTTFTATTHNYSYLNLTNDANEV 
NI LKSRI PKEAEHINGDRTLYQFGATFTQKALMKANEI LETQSSNARKKLI FHVTDGVPT 
MSYAINFNPYISTSYQNQFNSFLNKI PDRSG I LQEDF 1 1 NGDDYQ I VKGDGES FKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEGYA INSGY I YL YWRDYNWVYP FDPKTKKVS ATKQ 
I KTHGEPTTLYFNGNI RPKGYD I FTVG I GVNG DPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKI YDELNKYFKT I VEEKHS I VDGNVTD PMGEM I EFQLKNGQS FTHDDYVLVGNDGSQL 
KNGVALGGPNSDGG I LKDVTVTYDKTSQT I KINHLNLGSGQKWLTYDVRLKDNYI SNKF 
YNTNNRTTLS PKSE KEPNT I RDFP I PKI RD VREFP VLT I SNQKKMGEVEFI KVNKDKHS E 
SLLGAKFQLQ IE KDFSG YKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I SSPDGYI EV 
KTKPVVT FT I QNGEVTNLKADPNANKNQIGYLEGNGKHL ITNTPKRPPGV 

SEQ ID NO. 4114: SAG0649 FROM M732 GBS TYPE III STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVL KNDNDKSETSHETVEGSGEATFENI KPG 
DYTLREETAP I GYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGSKVGEQYKALNP I NGKDGRRE IAEGWLSKKNTGVNDLDKNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVA 
LVTYAST I FDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEV 
NILKSRI PKEAEHI NGDRTLYQFGATFTQKALMKANE I LETQS SNARKKL I FHVTDGVPT 
MSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDR 
K^PVTGGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQ 
I KTHGEPTTLYFNGNI RPKGYD I FTVG I GVNGDPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKI YDELNKYFKT I VEEKHS I VDGNVTDPMGEM I E FQLKNGQS FTHDDYVLVGNDGSQL 
KNGVALGGPNSDGG I LKDVTVTYDKTSQT I KI NHLNLGSGQKVVLTYDVRLKDNYI SNKF 
YNTNNRTTLS PKSEKEPNT I RDFPI PKIRDVREFPVLT I SNQKKMGEVEFI KVNKDKHSE 
SLLGAKFQLQ I EKDFSGYKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I SSPDGYI EV 
KTKPWTFT I QNGEVTNLKADPNANKNQI GYLEGNGKHL I TNTPKRPPGV 

SEQ ID NO. 4115: SAG0649 FROM COH1 GBS TYPE III STRAIN 

GETQDTNQALGK^IVKKTGDNATPLGKATFVLKNDNDKSETSHETVEGSGXATFENIKPG 
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DYTLREETAP IGYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 

KENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKNTGW 

GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAV^ 

LVTYAST I FDGTEATVS KGVADQNGKWJ^S VSWDYHKTTFTATTHNYS YL^TNDAl^V 
NI LKSRI PKEAEH I NGDRTLYQFGATFTQKALMKANE I LETQS SNARKKL I PHVTDGVPT 
MS YA I NFNP Y I STS YQNQFNSFLNKI PDRSGI LQEDF 1 1 NGDD YQ I VKGDGES FKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQ 
I KTHGEPTTLYFNGNI RPKGYD I FTVG IGVNGDPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKI YDELNKYFKT I VEEKHS I VDGNVTDPMGEM I EFQLKNGQS FTHDDYVLVGNDGSQL 
KNGVALGGPNSDGGI LKDVTVTYDKTSQTI KINHLNLGSGQKVVLTYDVRLKDNYI SNKF 
YNTNNRTTLSPKSEKEPNTIRDFP I PKIRDVREFPVLTI SNQKKMGEVEF IKVNKDKHSE 
SLIjGAKFQLQIEKDFSGYKQFVPEGSDVTTKNDGKIYFKALQDGNYKLYEJSSPDGYIEV 
KTKPWTFTI QNGE VTNLKADPNANKNQ I GYLEGNGKHL I TNTPKRP PG V 

SEQ ID NO. 4115: SAG0649 FROM M781 GBS TYPE III STRAIN 

GKVIVKKTGDTATPLGKATFVLKNDNDKSETSHETVEGSGKATFENIKPGDYTLREETAP 
IGYKKTDKTWKVKVADNGAXI I EGMDADKAEKRKEVIiNAQYPKSAI YEDTKENYPLVNVE 
GS KVGEQYKALNP I NGKDGRRE I AEGWLSKKITGVNDLDKNKYKI ELTVEGKTTVETKEL 
NQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLI DKITSNKDNRVALVTYASTI FD 
GTEATVSKGVADQNGKAI^SVSWDYHKTTFTATTHNYSYI^LTbnDANEVNILKSRIPKE 
AEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINFNPY 
I STS YQNQFNSFLNKI PDRSGI LQEDF I INGDDYQ I VKGDGESFKLFSDRKVPVTGGTTQ 
AAYRVPQNQLSVMSNEGYAINSGYI YLYWRDYNWVYPFDPKTKKVSATKQI KTHGEPTTL 
YFNGN I RPKGYD I FTVG I GVNGDPGATPLEAE KFMQS I S S KTENYTNVDDTNKI YDELNK 
YFKTIVEEKHS I VDGNVTDPMGEM I EFQLKNGQS FTHDDYVLVGNDGSQLKNGVALGGPN 
SDGGILKDVTVTYDKTSQTIKINHLNLGSGQKVVLTYDVRLKDNY 

PKSEKEPNT I RDFPI PKIRDVREFPVLTI SNQKKMGEVEF IKVNKDKHSESLLGAKFQLQ 
lEKDFSGYKQFVPEGSDVTTKNDGKIYFKALQDGNYKLYEISSPDGYIEVKTKPVVTFTI 
QNGE VTNLKADPNANKNQ I GYLEGNGKHL I TNTPKRP PGV 

SEQ ID NO. 4117: SAG0649 FROM CJB110 GBS NONTYPEABLE STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVLKNDNDKSETSHETVEGSGXATFENI KPG 
DYTLREETAP I GYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKE VLNAQYPKS AI YEDT 
KENYPLVNVEGS KVGEQYKALNP I NGKDGRRE I AEGWLS KKI TGVNDLDKNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNK^ 
LVTYAST I FDGTEATVS KG VADQNGKALNDS VS WD YHKTTFTATTHNYS YLNLTNDANE V 
NI LKSRI PKEAEHINGDRTLYQFGATFTQKALMKANE ILETQSSNARKKLI FHVTDGVPT 
MS YAINFNPYI STS YQNQFNSFLNKI PDRSGI LQEDFI INGDDYQ I VKGDGESFKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEGYAINSGY I YLYWRDYNWVYP FD PKTKKVSATKQ 
I KTHGEPTTLYFNGN I RPKGYD I FTVG I GVNGDPGATPLEAE KFMQS I SSKTENYTNVDD 
TNKIYDELNKYFKTIVEEKHSIVDGNVTDPMGEMIEFQLKNGQSFTHDDYVLVGNDGSQL 
KNGVALGGPNSDGGI LKDVTVTYDKTSQTI KINHLNLGSGQKWLTYDVRLKDNY I SNKF 
YNTNNRTTLSPKSEKEPNTI RDFPI PKIRDVREFPVLTI SNQKKMGEVEFI KVNKDKHSE 
SLLGAKFQLQIEKDFSGYKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYEI SSPDGYI EV 
KTKP WT FT I QNGE VTNLKADPNANKNQI GYLEGNGKHL I TNTPKRPPG V 

SEQ ID NO. 4118: SAG0649 FROM JM9130013 GBS TYPE VIII STRAIN 

GETQDTNQALGKVIVKKTGDNATPLGKATFVLKNDNDKSETSHETVEGSGEATFENIKPG 
DYTLREETAP IGYKKTDKTWKVKVADNGATI I EGMDADKAEKRKE VLNAQYPKS A I YEDT 
KENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVN 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKD 
LVTYAST I FDGTEATVS KGVADQNGKALNDS VSVTO YHKTTFTATTHNYS YLNLTNDANE V 
NI LKSRI PKEAEHINGDRTLYQFGATFTQKALMKANE ILETQSSNARKKLI FHVTDGVPT 
MS YA I NFNP Y I STS YQNQFNS FLNKI PDRSG I LQEDF 1 1 NGDD YQ I VKGDGESFKLF SDR 
KVPVTGGTTQAAYRVPQNQLS VMSNEG YA I NSGY I YLYWRDYNWVYP FDPKTKKVSATKQ 
I KTHGEPTTLYFNGN I RPKGYD I FTVG IGVNGDPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKI YDELNKYFKTI VEEKHSI VDGNVTDPMGEM I EFQLKNGQSFTHDDYVLVGNDGSQL 
KNGVALGGPNSDGGI LKDVTVTYDKTS QT I KINHLNLGSGQKVVLTYDVRLKDNY I SNKF 
YNTNNRTTLSPKSEKEPNT IRDFPI PKIRDVREFPVLTI SNQKKMGEVEF I KVNKDKHSE 
SLLGAKFQLQIKKDFSGYKQFVPEGSDVTTKNDGKIYFKALQDGNYKLYEISSPDGYIEV 
KTKP WTFTI QNGE VTNLKADPNANKNQI GYLE 
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PRETTY of : /biotmp/msal78371.2{*} May 12, 2003 09:25 

1 50 

=S?5S:S SfiSS == z=2 32SSS ^SSSt 

msal78371.2{l04 090 9* tqdtnqalGK VIVKKTGDnA 

maal78371 2{l04 18RS21) 9e tqdtnqalGK VIVKKTGDnA 

msal78371.2{l04 2603} mkkrqkiwrg lsvtllileq ipfgilvqge tqdtnqalGK VIVKKTGDnA 

msal78371.2 104-A909} 9* tqdtnqalGK mmODnJ 

msal78371.2{l04 JM9130013} 9« tqdtnqalGK VIVKKTGDnA 

~~ Consensus ********** ********** ★*★*****-- ** ********-* 

51 100 

msal78371.2{l04 CJB110} TPLGKATFVL KNDNDKSETS HETVEGSGxA TFENIKPGDY TLREETAP I G 

msal78371.2{!04 M78l} TPLGKATFVL KNDNDKSETS HETVEGSGkA TFENIKPGDY TLREETAP I G 

msa!78371 . 2 { 104~COH1 } TPLGKATFVL KNDNDKSETS HETVEGSGxA TFENIKPGDY TLREETAP I G 

msal78371.2{l04~M732} TPLGKATFVL KNDNDKSETS HETVEGSGeA TFENIKPGDY TLREETAP I G 

rasal78371.2{l04 090} TPLGKATFVL KNDNDKSETS HETVEGSGeA TFENIKPGDY TLREETAP I G 

msal78371 .2(104 18RS21} TPLGKATFVL KNDNDKSETS HETVEGSGeA TFENIKPGDY TLREETAP I G 

msal78371.2{l04 2603} TPLGKATFVL KNDNDKSETS HETVEGSGeA TFENIKPGDY TLREETAPIG 

msal78371 .2{104~A909} TPLGKATFVL KNDNDKSETS HETVEGSGeA TFENIKPGDY TLREETAPIG 

msal78371. 2(104 JM9130013} TPLGKATFVL KNDNDKSETS HETVEGSGeA TFENIKPGDY TLREETAPIG 

~ Consensus ********** ********** ********-* ********** ********** 

101 150 

msal78371.2{l04 CJB110} YKKTDKTWKV KVADNGAt I I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

msal78371.2{l04 M78l} YKKTDKTWKV KVADNGAxI I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

msal78371.2{l04~COHl} YKKTDKTWKV KVADNGAt I I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

msal78371.2{l04~M732} YKKTDKTWKV KVADNGAt I I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

msal78371.2{l04 090} YKKTDKTWKV KVADNGAt I I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

rasal78371 2 {104 18RS21} YKKTDKTWKV KVADNGAt I I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

msal78371.2{l04 2603} YKKTDKTWKV KVADNGAt I I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

msal78371.2{l04~A909} YKKTDKTWKV KVADNGAt I I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

msal78371.2f 104 JM9130013} YKKTDKTWKV KVADNGAt I I EGMDADKAEK RKEVLNAQYP KSAIYEDTKE 

~ Consensus ********** *******-** ********** ********** ********** 

151 200 

msal78371. 2(104 CJB110} NYPLVNVEGS KVGEQYKALN PINGKDGRRE lAEGWLSKKi TGVNDLDKNK 

msal78371.2{l04 M781} NYPLVNVEGS KVGEQYKALN PINGKDGRRE lAEGWLSKKi TGVNDLDKNK 

tnsal78371.2{l04~COHl} NYPLVNVEGS KVGEQYKALN PINGKDGRRE IAEGWLSKKn TGVNDLDKNK 

msal78371.2{l04"~M732 NYPLVNVEGS KVGEQYKALN PINGKDGRRE IAEGWLSKKn TGVNDLDKNK 

msa!78371.2{l04 090} NYPLVNVEGS KVGEQYKALN PINGKDGRRE lAEGWLSKKi TGVNDLDKNK 

msal78371.2{l04 18RS21} NYPLVNVEGS KVGEQYKALN PINGKDGRRE lAEGWLSKKi TGVNDLDKNK 

msal78371.2{l04 2603} NYPLVNVEGS KVGEQYKALN PINGKDGRRE lAEGWLSKKi TGVNDLDKNK 

msal78371.2{l04~A909} NYPLVNVEGS KVGEQYKALN PINGKDGRRE lAEGWLSKKi TGVNDLDKNK 

msal7837I. 2(104 JM913 0013} NYPLVNVEGS KVGEQYKALN PINGKDGRRE lAEGWLSKKi TGVNDLDKNK 

~ Consensus ********** ********** ********** *********- ********** 

201 250 

msal78371 -2(104 CJB110} YKI ELTVEGK TTVETKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

msal78371.2(l04 M781} YKI ELTVEGK TTVETKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

msal78371.2{l04~COHl} YKI ELTVEGK TTVETKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

msal 78371.2(104 M732} YKI ELTVEGK TTVETKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

msal78371.2{l04 090} YKI ELTVEGK TTVETKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

msal78371.2{l04 18RS21} YKI ELTVEGK TTVETKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

msal78371.2{!04 2603} YKI ELTVEGK TTVETKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

msal78371.2{l04 A909} YKI ELTVEGK TT VE TKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

msal78371.2{l04 JM9130013) YKI ELTVEGK TTVETKELNQ PLDVWLLDN SNSMNNERAN NSQRALKAGE 

~ consensus ********** ********** ********** ********** ********** 

251 300 

msal78371. 2(104 CJB110} AVEKLIDKIT SNKDNRVALV TYASTIFDGT EAT VS KGVAD QNGKALNDSV 

msal78371.2{l04 M781} AVEKLIDKIT SNKDNRVALV TYASTIFDGT EATVS KGVAD QNGKALNDSV 

msal78371.2{l04~COHl} AVEKLIDKIT SNKDNRVALV TYASTIFDGT EATVS KGVAD QNGKALNDSV 

msal78371 .2(104 M732) AVEKLIDKIT SNKDNRVALV TYASTIFDGT EATVS KGVAD QNGKALNDSV 

msal78371.2{l04 090} AVEKLIDKIT SNKDNRVALV TYASTIFDGT EATVS KGVAD QNGKALNDSV 

msal78371.2(l04_18RS2l} AVEKLIDKIT SNKDNRVALV TYASTIFDGT EATVSKGVAD QNGKALNDSV 

msal78371. 2(104 2603} AVEKLIDKIT SNKDNRVALV TYASTIFDGT EATVSKGVAD QNGKALNDSV 

msal78371.2{l04~A909) AVEKLIDKIT SNKDNRVALV TYASTIFDGT EATVSKGVAD QNGKALNDSV 

msal 78371.2(104 JM9130013} AVEKLIDKIT SNKDNRVALV TYASTIFDGT EATVSKGVAD QNGKALNDSV 

~ Consensus ********** ********** ********** ********** ********** 

301 350 

msal78371. 2(104 CJB110} SWDYHKTTFT ATTHNYSYLN LINDANE VN I LKSRIPKEAE HI NGDRTL YQ 

msal78371. 2(104 M78l} SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE H I NGDRTL YQ 

msal7 8371.2{l04~COHl} SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE HI NGDRTL YQ 

msal78371.2{l04~M732} SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE HINGDRTLYQ 

msal78371. 2(104 090} SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE HINGDRTLYQ 

msal7837l 2(104 18RS21} SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE HINGDRTLYQ 

msal78371. 2(104 2603} SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE HINGDRTLYQ 

msal78371.2(104~A909} SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE HINGDRTLYQ 
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msal78371.2{l04_JM9130013} SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE HINGDRTLYQ 
"~ Consensus ********** ********** ********** ********** ********** 



msal78371.2{ 
msa!78371. 
msal78371. 
msal78371. 
msal78371 
msal78371.2{ 
msal78371. 
msal78371. 
msal78371.2{l04 



104_CJB110) 
2{104_M781} 
2{104_COH1} 
2{104_M732} 
2{l04_090} 
104_18RS2l} 
2{104_2603} 
2{l04_A909} 
: _JM9130013} 
Consensus 



351 

FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 



MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 



SSNARKKLIF 
SSNARKKLIF 
SSNARKKLIF 
SSNARKKLIF 
SSNARKKLIF 
SSNARKKLIF 
SSNARKKLIF 
SSNARKKLIF 
SSNARKKLIF 



HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 



********** ********** ********** ********** 



400 

YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
********** 



401 450 

msal78371.2{l04_CJB110} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

msal78371.2{l04_M78l} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

msal78371.2{l04_COHl} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

msal78371.2{l04_M732} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

msal78371.2{l04_090} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

msal78371.2{l04_18RS2l} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

msal78371. 2 { 104_2603} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

msal78371.2{l04_A909} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

rasal78371 .2{104_JM9130013} TSYQNQFNSF LNKIPDRSGI LQEDFIINGD DYQIVKGDGE SFKLFSDRKV 

Consensus ********** ********** ********** ********** ********** 



msal78371.2{ 
msal78371. 
msal78371. 
msal78371. 
msal78371 
msal78371.2{ 
msal78371 
msal78371 
msal78371.2{l04 



104_CJB110} 
2{104_M781} 
2{104_COH1} 
2{104_M732} 
.2{104_090} 
104_18RS21} 
2 { 104^2603} 
2{104_A909) 
JM9130013) 
Consensus 



451 

PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
. PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 



YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 



********** ********** 



MSNEGYAINS 
MSNEGYAINS 
MSNEGYAINS 
MSNEGYAINS 
MSNEGYAINS 
MSNEGYAINS 
MSNEGYAINS 
MSNEGYAINS 
MSNEGYAINS 
********** 



GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
********** 



500 

NWVYPFDPKT 
NWVYPFDPKT 
NWVYPFDPKT 
NWVYPFDPKT 
NWVYPFDPKT 
NWVYPFDPKT 
NWVYPFDPKT 
NWVYPFDPKT 
NWVYPFDPKT 
********** 



msal78371.2{ 
msal78371. 
msal78371. 
msal78371. 
msal78371 
msal78371.2{ 
msal78371 . 
msal78371. 
msal78371.2{!04 



104_CJB110} 
2{104_M781} 
2{104_COH1> 
2{104_M732} 
2{104_090} 
104_18RS2l} 
2{104_2603} 
2{104_A909} 
: _JM9130013} 
Consensus 



501 

KKVSATKQIK 
KKVSATKQIK 
KKVSATKQIK 
KKVSATKQIK 
KKVSATKQIK 
KKVSATKQIK 
KKVSATKQIK 
KKVSATKQIK 
KKVSATKQIK 
********** 



THGEPTTLYF 
THGEPTTLYF 
THGEPTTLYF 
THGEPTTLYF 
THGEPTTLYF 
THGEPTTLYF 
THGEPTTLYF 
THGEPTTLYF 
THGEPTTLYF 
********** 



NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
********** 



IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
********** 



550 

DPGATPLEAE 
DPGATPLEAE 
DPGATPLEAE 
DPGATPLEAE 
DPGATPLEAE 
DPGATPLEAE 
DPGATPLEAE 
DPGATPLEAE 
DPGATPLEAE 
********** 



rnsal78371.2{ 
msal78371. 
msal78371. 
msal78371. 
msal78371 
msal78371.2{ 
msal78371. 
msal78371. 
msal78371.2{l04 



104_CJB110} 
2{104_M781} 
2{104_COH1} 
2{104_M732} 
.2{l04_090} 
104_18RS21) 
2f 104_2603j 
2{104_A909} 
JM9130013} 
Consensus 



551 

KFMQSISSKT 
KFMQSISSKT 
KFMQSISSKT 
KFMQSISSKT 
KFMQSISSKT 
KFMQSISSKT 
KFMQSISSKT 
KFMQSISSKT 
KFMQSISSKT 
********** 



ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
********** 



KIYDELNKYF 
KIYDELNKYF 
KIYDELNKYF 
KIYDELNKYF 
KIYDELNKYF 
KIYDELNKYF 
KIYDELNKYF 
KIYDELNKYF 
KIYDELNKYF 
********** 



KTIVEEKHSI 
KTIVEEKHSI 
KTIVEEKHSI 
KTIVEEKHSI 
KTIVEEKHSI 
KTIVEEKHSI 
KTIVEEKHSI 
KTIVEEKHSI 
KTIVEEKHSI 
********** 



600 

VDGNVTD PMG 
VDGNVTD PMG 
VDGNVTD PMG 
VDGNVTD PMG 
VDGNVTD PMG 
VDGNVTDPMG 
VDGNVTDPMG 
VDGNVTDPMG 
VDGNVTDPMG 
********** 



msal78371 .2{ 
msal78371. 
msa!78371, 
msa!78371. 
msal78371 
msal78371.2{ 
msal78371. 
msal78371. 
msal78371.2{!04 



104_CJB110} 
2{l04__M78lJ 
2{104_COH1) 
2{104_M732} 
.2{104__090} 
104_18RS2l| 
2{104_2603} 
2{104_A909} 
.JM9130013} 
Consensus 



601 

EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 



QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 



VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 



********** ********** ********** 



GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
********** 



650 

GGILKDVTVT 
GGILKDVTVT 
GGILKDVTVT 
GGILKDVTVT 
GGILKDVTVT 
GGILKDVTVT 
GGILKDVTVT 
GGILKDVTVT 
GGILKDVTVT 
********** 



651 700 

msal78371.2{l04_CJB110} YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 

msal78371.2fl04_M78l} YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 

msal78371.2{104_COHl} YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 

msal78371.2{!04_M732} YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 

msal78371.2{l04_090} YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 

msal78371 . 2 { 104_18RS2l) YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 

msal78371.2{l04_2603} YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 
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msal78371 . 2 { 104_A909 ) 
msal78371.2{l04_JM9130013} 
Consensus 



YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 
YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 
********** ********** ********** ********** ********** 



msal78371.2{ 
msal78371 
msal78371 
msal78371. 
msal7837l 
msal78371.2{ 
msal78371. 
msal78371 . 
msal78371.2{l04 



104J2JB110} 
2{104_M781} 
2{104_COH1} 
2{104_M732} 
.2{ 104^090} 
104JL8RS21} 
2{104_2603} 
2{104_A909} 
_JM9130013} 
Consensus 



701 

SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
********** 



FPIPKIRDVR 
PPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
********** 



EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 



KKMGEVEFIK 
KKMGEVEFIK 
KKMGEVEFIK 
KKMGEVEFIK 
KKMGEVEFIK 
KKMGEVEFIK 
KKMGEVEFIK 
KKMGEVEFIK 
KKMGEVEFIK 



********** ********** 



750 

VNKDKHSESL 
VNKDKHSESL 
VNKDKHSESL 
VNKDKHSESL 
VNKDKHSESL 
VNKDKHSESL 
VNKDKHSESL 
VNKDKHSESL 
VNKDKHSESL 
********** 



msal78371 .2{ 
msal78371. 
msal78371 . 
msa!78371 . 
msal78371 
msal78371.2{ 
msal78371. 
msal78371. 
msa!78371 ,2{104 



104_CJB110} 
2{l04_M7Sl} 
2{l04_COHl} 
2{104_M732} 
.2{104_090} 
104_18RS2l} 
2{104_2603} 
2{104_A909} 
JM9130013} 
Consensus 



msal78371.2{ 
msa!78371. 
msal78371. 
msal78371. 
msal78371 
msal78371.2{ 
tnsal78371 . 
msal78371. 
msa!78371.2{l04 



104_CJB110; 
2{l04_M78l] 
2{104_C0H1] 
2{104_M732; 
.2{104_090; 
104_18RS2l] 
2{104_2603} 
2{104_A909} 
.JM9130013} 
Consensus 



751 

LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIk 
********** 

801 

SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
********** 



KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 



PEGSDVTTKN 
PEGSDVTTKN 
PEGSDVTTKN 
PEGSDVTTKN 
PEGSDVTTKN 
PEGSDVTTKN 
PEGSDVTTKN 
PEGSDVTTKN 
PEGSDVTTKN 



DGKIYFKALQ 
DGKI YFKALQ 
DGKIYFKALQ 
DGKIYFKALQ 
DGKIYFKALQ 
DGKIYFKALQ 
DGKIYFKALQ 
DGKIYFKALQ 
DGKIYFKALQ 



********** ********** ********** 



KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
********** 



GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
********** 



NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
********** 



800 

DGNYKLYEIS 
DGNYKLYEIS 
DGNYKLYEIS 
DGNYKLYEIS 
DGNYKLYEIS 
DGNYKLYEIS 
DGNYKLYEIS 
DGNYKLYEIS 
DGNYKLYEIS 
********** 

850 

Egngkhlitn 
Egngkhlitn 
Egngkhlitn 
Egngkhl i tn 
Egngkhlitn 
Egngkhlitn 
Egngkhlitn 
Egngkhlitn 
E 



851 890 

msal78371 . 2 { 104_CJB110 } tpkrppgv 

msal78371.2{l04_M78l} tpkrppgv — 

msa!78371 . 2 { 104_COH1 } tpkrppgv 

msal78371.2{l04_M732} tpkrppgv- 

tnsal78371.2{l04_090} tpkrppgv — — 

msal78371.2{l04_18RS2l} tpkrppgv — 

msal78371. 2 { 104^2603} tpkrppgvfp ktggigtivy ilvgstfmil ticsfrrkql 

msal78371.2{l04_A909} tpkrppgv 

msal78371.2{l04_JM9130013} 

Consensus ** ********** ********** ********** 
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